Re: [Beowulf] [OT] HPC and University IT - forum/mailing list?

Gerry Creager N5JXS Thu, 17 Aug 2006 06:50:22 -0700

Michael Huntingdon wrote:

At 04:13 PM 8/15/2006, Mike Davis wrote:
I'm not 100% sure about that Mark. I care about big-A administration.I care about showing departments what resources are actuallyavailable. I care about what is the most efficient use of limitedUniversity resources. When I meet with researchers they often say thatthey had no idea that there were 500+ processors dedicated to researchhere.
I know that other people have the same issues. Another is the fundingmodel issue. Which is best overhead, direct, or central budget? Or howabout knowing what resources we each provide our users. Does a givenorganization focus on hardware support, software support or both?
Those are some of the Big-A issues. Here is one that is both Big-A andsmall-A.
Running one of the new Sun x4100's with both dualcore processors at100% uses <270 watts (as determined by kill-a-watt. That is Big-Abecause it means that we can be more efficient in our use of AC andpower. It is small-a for the same reasons. For example spinning up av20 uses 250 watts for both processors at full power. I can't discusssome of my application specific performance due to licenseconstraints, but I can say that I like the 4100 in general forComputational Physics and Chemistry.
I often scratch my head wondering how certain decisions are make at the"central IT" level, so a perspective from the campus that involves bothperformance and up time (plug in the wall) costs is refreshing. We farto often see a complete disconnect between the two, which very oftenmeans that none of the invested parties (at either theNSF/NIH/state/federal level) ever really enjoy the value of each dollarthey invest.

My HPC efforts are now involving both cases of 'A' as we're trying tochange the HPC paradigm on campus from almost solely SMP to acombination of memory paradigms to better serve the research community.And the 'we' is not directly in the Administration chain but is from acouple of on-campus players who are frustrated with the status quo. Ilook at infrastructure costs as well as per-node costs, maintenance andpotential down-time vice spares. Also, I now have to considernetworking costs, both on campus and in our commodity, Internet2 and NLRdealings. Bandwidth is no longer free, although a lot of researchersstill think it is.

I appreciate that Sun may be suggesting (these days) that their systemsare more environmentally friendly; however, given theprice/performance/environmental/support...and really crazy extended downtime associted with engineering issues, logic at least for some, makesdistancing significant IT investment with Sun a decision that followsvery few conversations.

Interesting: I've started buying a lot more Sun as their costs arecomparable to the blade cluster prices I've gotten from other vendors,we have seen as good, or better up-times with v2100 through v4200 serieshardware, and time-to-repair has been stellar, with the on-site Sunsupport. When we had problems flashing a BIOS upgrade and couldn'trecover it ourselves, our last real Sun downtime, they had a newmotherboard in within 6 hours and the system back up an hour later. Ihaven't suffered from the engineering issues you seem to have encountered.

My point comes honestly from your comments, which we hold dear.....thegrowing number of research system/cpu's on campus affect each and everyone of us on a daily basis. Having spent this week at the LSS event atStanford, I am ever more convinced, how diverse the needs...and thenumber of possible solutions. So that must be a big-A approach with ahuge tilt in a not so big-A direction.
Another that is both is what submission systems we are using and Why?

Same questions, that affect both administration and Administration.


Mike davis






Mark Hahn wrote:
beowulf traffic itself is "noise"?  If you are thinking of a "list for
university deans" or members of research support offices ordepartmental
...
administerable and accountable should they get audited) -- then yeah, I
think a new list or other venue would be very useful.
yes. the overlap is minimal, I believe - I'd say the two approachesare even inimical. someone who is primarily interested in big-AAdministration will have values opposed to mine as a technologist.as a random pot-shot, big-a people tend to have great faith innegotiating special purchasing relationships with a vendor, orbelieve that integration
is the high-road to success (or an end in itself).  I know, OTOH,
that a vendor who makes a good desktop may make the worlds worst compute
nodes, and that, for instance, the service requirements are nearlyopposite.here's my general conclusion about central-IT efforts: if the idea(centralized storage, whatever) is so good,
people will beg to use it.  if you have to force people to use it,
you are simply wrong in some way (perhaps subtly).

Just to touch on this, I'm in general agreement, although our big-a havejust negotiated a big-iron acquisition for a decent price. It'll gointo the central IT core for shared computing resources... I'll see howwell it works out. The central IT HPC folks were the proximate cause ofme building my first cluster...


gerry
--
Gerry Creager -- [EMAIL PROTECTED]
Texas Mesonet -- AATLT, Texas A&M University        
Cell: 979.229.5301 Office: 979.458.4020 FAX: 979.862.3983
Office: 1700 Research Parkway Ste 160, TAMU, College Station, TX 77843
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] [OT] HPC and University IT - forum/mailing list?

Reply via email to