Book review: High Performance Linux Clusters <i>by Joseph D Sloan</i>

Book review: High Performance Linux Clusters by Joseph D Sloan


Like distributed computing, clusters are a hot topic in the current computing climate. The reason is simple, with the explosion of Linux and cheaper components it’s actually become quite simple and inexpensive to put together a relatively high-powered cluster. Driving the cluster production is an increased need for computing power as applications are developed for different situations. For example, graphics rendering—often used in films and TV, as well as custom applications—and computer-based modelling of complex environment such as weather systems or DNA, all require vast quantities of computing power.

The book’s cover The book’s cover

In High Performance Linux Clusters (published by O’Reilly), Joseph D Sloan provides a comprehensive guide to building and deploying cluster solutions. Starting from the mechanics of hardware and operating systems right through to deployment and administration tools, the book provides everything you should need to get started.

A brief look through the book shows you that the scope of the title is wide, but it’s also not lacking information. There’s content on setting up your cluster, managing it, and profiling and debugging it to achieve the best performance.

Sloan has an easy to read but highly effective writing style that handles the complicated topics with apparent ease

The contents

The approach of the book is a straightforward lead through from the basics to the more advanced topics. The introductory chapter should be required reading for anybody who isn’t already familiar with clusters or indeed many other types of high performance computing (HPC). It would probably be a good idea for those that think they know about HPC to read this too as it serves as an excellent introduction not only to clusters but also single and multi-processor computing and the basics of distributed computing too.

Sloan has an easy to read but highly effective writing style that handles the complicated topics with apparent ease and this is best demonstrated with the next few chapters where he looks at the theoretical side of clusters; the planning and decisions that should be made before you start plugging machines together and installing software. By Chapter 4 we’re into the finer points of selecting and installing Linux.

Part II looks at some of the “turnkey” solutions for getting your Linux cluster up and running: openMosix, OSCAR and Rocks. As with chapters in the previous section, the information supplied is extensive with little left to guess work or the imagination if this is the route you want to take.

For more custom and personalized clusters, Part III offers information on specific tools you’ll need such as development libraries, management software and the PVFS parallel filesystem.

The final part—part IV—looks at programming for your cluster. This sections concentrates on using MPI—one of the best and most practical solutions for writing cluster software. Again, the approach is simple and exceedingly practical, starting with a look at a serial application designed to run on a single computer, and then moving on to the same problem but written with a cluster in mind. Finally, the author finishes up with information on how to debug and profile parallel applications.

Who’s this book for?

Anybody who is using, or thinking of using, a cluster should find this book useful. There’s a lot of information here from the design of your cluster right up to the implementation and usage of the cluster. Administrators, managers and developers should get a lot of use out of the book.

It’s also going to appeal to the hobbyist and enthusiast who wants to build their own small cluster. There is nothing elitist about the book and the subject matter—clusters on Linux on commodity hardware—means that many people may decide to build their cluster in their garage.

Pros

If you are interested in clusters—even from a purely theoretical standpoint—this book is a good guide. Aside from knowledge of computing and Linux, the author doesn’t expect you to have any previous experience with clusters. He doesn’t talk down to you either and that makes the information and techniques covered in the book easy to use and understand.

It’s also hard to imagine that you will need any other information that isn’t already covered in the book—there is so much detail, in such a wide range that the book acts as a one stop shop for cluster computing. As if that wasn’t enough, the author has also included an impressive list of other references, books and websites, just in case you want more specific details.

Cons

I really can’t suggest any reasons why you shouldn’t buy this book. There are one or two inevitable typos, but nothing that detracts or misleads, and there are a few websites I was surprised to see missing. But really, I’m scraping the barrel on what is an excellent book that I would recommend to anybody with even a passing interest in clusters.

Title High Performance Linux Clusters
Author Joseph D Sloan
Publisher O’Reilly
ISBN 0596005709
Year 2005
Pages 350
CD included No
Mark (out of 10) 10

In short

Category: 
License: 

Author information

Martin Brown's picture

Biography

Martin “MC” Brown is a member of the documentation team at MySQL and freelance writer. He has worked with Microsoft as an Subject Matter Expert (SME), is a featured blogger for ComputerWorld, a founding member of AnswerSquad.com, Technical Director of Foodware.net and, and has written books on topics as diverse as Microsoft Certification, iMacs, and free software programming.

Most forwarded

Interview with Dave Mohyla, of DTIDATA

Dave Mohyla is the president and founder of dtidata.com, a hard drive recovery facility based in Tampa, Florida.

TM: Where are you based? What does your company do?
DTI Data recovery is based in South Pasadena, Florida which is a suburb of Tampa. We have been here for over 10 years. We operate a bio-metrically secured class 100 clean room where we perform hard drive recovery on all types of hard disks, from laptop hard drives to multi drive RAID systems.

Anybody up to writing good directory software?

Since the very beginning, directories (of any kind) have had a very central role in the internet. (I have recently grown fond of Free Web Directory. Even Slashdot can be considered a directory: a collection of great news and invaluable user-generated comments. As far as software is concerned, doing a quick search on Google about software directories will return the free (as in freedom) software directories like Savannah, SourceForge, Freshmeat and so on, followed by shareware and freeware sites such as FileBuzz, PCWin Download Center and All Freeware (great if you're looking for shareware and freeware, but definitely less comprehensive than their free-as-in-freedom counterparts).

Interview with Mark Shuttleworth

Mark Shuttleworth is the founder of Thawte, the first Certification Authority to sell public SSL certificates. After selling Thawte to Verisign, Mark moved on to training as an astronaut in Russia and visiting space. Once he got back he founded Ubuntu, the leading GNU/Linux distribution. He agreed on releasing a quick interview to Free Software Magazine.

Is better education the key to finding better software?

I read David Jonathon's article Anybody Up To Writing Good Directory Software? the other day, which got me thinking about software directories in general. As David mentioned, many of the software directories one finds when doing a quick google search are free as in beer, not as in freedom. But what interests me is the software directories that already exist, providing a combination of both free as in beer software, and open source software. Sites such as Freeware Downloads and Shareware Download don't advertise themselves as providing free as in liberty software, but each of them have a good selection of open source software available... if you know where to look.

Most emailed

Free Open Document label templates

If you’ve ever spent hours at work doing mailings, cursed your printer for printing outside the lines on your labels, or moaned “There has got to be a better way to do this,” here’s the solution you’ve been looking for. Working smarter, not harder! Worldlabel.com, a manufacture of labels offers Open Office / Libre Office labels templates for downloading in ODF format which will save you time, effort, and (if you want) make really cool-looking labels

Creating a user-centric site in Drupal

A little while ago, while talking in the #drupal mailing list, I showed my latest creation to one of the core developers there. His reaction was "Wow, I am always surprised what people use Drupal for". His surprise is somehow justified: I did create a site for a bunch of entertainers in Perth, a company set to use Drupal to take over the world with Entertainers.Biz.

Update: since writing this article, I have updated the system so that the whole booking process happens online. I will update the article accordingly!

So, why, why do people and companies develop free software?

More and more people are discovering free software. Many people only do so after weeks, or even months, of using it. I wonder, for example, how many Firefox users actually know how free Firefox really is—many of them realise that you can get it for free, but find it hard to believe that anybody can modify it and even redistribute it legally.

When the discovery is made, the first instinct is to ask: why do they do it? Programming is hard work. Even though most (if not all) programmers are driven by their higher-than-normal IQs and their amazing passion for solving problems, it’s still hard to understand why so many of them would donate so much of their time to creating something that they can’t really show off to anybody but their colleagues or geek friends.

Sure, anybody can buy laptops, and just program. No need to get a full-on lab or spend thousands of dollars in equipment. But... is that the full story?

Fun articles

Santa Claus - the most successful open source project

It dawned on me the other day, as I was shopping for the dozens of gifts it seems I have to buy every December, that Santa Claus is the most successful open source project in history. (Bridget @ Illiterarty would agree with that). Santa Claus is essentially a marketing development that is embodied by everyone who stuffs a sock, gives a gift, hosts a dinner or wishes Merry Christmas over the holiday season.

Most emailed

Editorial

When I first started thinking about Free Software Magazine, I was feeling enthusiastic about the dream. I had Dave, Gianluca, and Alan willing to help me, I had established members of the free software community willing to help me out, I had writers volunteering their time and energy for free, and I had a generous offer from OpenHosting for servers, all before I'd proved myself. There was a sense of excitement in the air, and I thought maybe, just maybe, I could make this work.

Free Software Magazine uses Apollo project management software and CRM for its everyday activities!