Re: [Beowulf] massive parallel processing application required

Gerry Creager Thu, 01 Feb 2007 05:10:03 -0800

Mitchell Wisidagamage wrote:

Please don't fall into the trap of thinking "e-Science" requires a tieto the Globus Toolkit to be valid.
I do not think this (anymore). I queried Matthew Haynos from IBM who'san expert in this area some time ago as I'm new to grid computing. Thesilly questions are from me :o) Answers are his.
Because at the moment distributed computing is only popular in the
academic research and highly specialized part of the industry...atleast
that's what I think. Any professional and personal comments from your
expereince?
Not true. Distributed computing is more and more mainstream. I thinktoo that you are looking at distributed computing perhaps too narowly.Even if you are referring to supercomputing, witness that more and moreof the Top 500 supercomputing sites are increasingly commerical (asopposed to academic or public institutions).
Anyhow I just read it again and you stated that "Grid computing becoming
more of a defacto standard for distributed computing in enterprises".

May I ask why do you think that?
I would say b/c of the growing ubiquity of scale-out computing (lots ofmachines, lots of resources, etc.) What's happening here is thatscheduling, etc. is going from the machine into the network. People nolonger know where things are going to run with hundreds / thousands ofblade processors. This is a sea change. People use to say run thispiece of work on this machine, now it's just run this work, I have noidea where. I've written an article series for IBM's grid site ondeveloperWorks:
Check out:http://www-128.ibm.com/developerworks/search/searchResults.jsp?searchType=1&pageLang=&displaySearchScope=dW&searchSite=dW&lastUserQuery1=perspectives+on+grid&lastUserQuery2=&lastUserQuery3=&lastUserQuery4=&query=perspectives+on+grid+haynos&searchScope=dW
particularly the "Next-generation distributed computing" article for aprimer. I think you'll find the five or so articles in the seriesinteresting.

I've read the article series and it is interesting. And, I'm notcompletely given over to anti-grid sentiment. The problem remains,however, to be embodied by a colleague, recounting his experience inrunning an ocean circulation model: "We only had a 13% slowdown runningthis as a grid application when compared to our local cluster."

Now, there are several things to consider that go unsaid here. One isthe degree of coupling in the code. Another is the size of the datasetsthat have to be moved to the various sites to facilitate operations.some codes will perform well when distributed broadly, while others willdie a horrid death waiting for pieces of the result to come back fromthat P3 installation in Outer Geekdom. Some will suffer simply fromcommunications latency. Others will just continue to chug along. Byway of illustration, we benchmarked my MM5 semi-production run of 72forecast hours for 3 domains of increasing resolution across the UnitedStates. To complete in the same timeframe as a locally submitted job,we found a requirement to double the number of processors when it wasdistributed out to the "grid". This is an extreme example, of course,and not one I propose to repeat anytime soon... It's much easier to runMM5 and WRF locally and not have to worry quite so much about resourcereservation and odd processors failing mid-run.

--
Gerry Creager -- [EMAIL PROTECTED]
Texas Mesonet -- AATLT, Texas A&M University        
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] massive parallel processing application required

Reply via email to