Re: [Beowulf] ECC Scrub, which setting?

2008-05-19 Thread Jim Lux
Quoting Mark Hahn <[EMAIL PROTECTED]>, on Mon 19 May 2008 08:47:46 PM PDT: It is currently set to Basic, which scrubs every 5.24 ms. You'll have to look in the manual to find out what that means -- it's probably "do a small amount of scrubbing every 5.24 ms". And you have I expect it's the i

Re: [Beowulf] ECC Scrub, which setting?

2008-05-19 Thread Mark Hahn
It is currently set to Basic, which scrubs every 5.24 ms. You'll have to look in the manual to find out what that means -- it's probably "do a small amount of scrubbing every 5.24 ms". And you have I expect it's the interval between cacheline-sized (64B) scrubs. as such, I think it's much too

Re: [Beowulf] TOE on Linux?

2008-05-19 Thread Greg Lindahl
On Mon, May 19, 2008 at 06:42:51PM -0400, Mark Hahn wrote: > the RDMA meme always puzzles me. easy to see why it's a seductive > idea, but I always run up against the problem of exactly how it's > supposed to be used - are there any moderately well-known applications > that specifically use RDM

Re: [Beowulf] ECC Scrub, which setting?

2008-05-19 Thread Charlie Peck
version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on quark.cs.earlham.edu On May 19, 2008, at 7:04 PM, Greg Lindahl wrote: It must suck when you lose tenure for publishing a wrong paper. If that's all it took to loose tenure there would be a lot more o

Re: [Beowulf] ECC Scrub, which setting?

2008-05-19 Thread Greg Lindahl
On Mon, May 19, 2008 at 09:09:34AM -0700, David Mathog wrote: > It is currently set to > Basic, which scrubs every 5.24 ms. You'll have to look in the manual to find out what that means -- it's probably "do a small amount of scrubbing every 5.24 ms". And you have to plug in your actual memory siz

Re: [Beowulf] TOE on Linux?

2008-05-19 Thread Mark Hahn
course not - Cisco sells both!), RDMA and TOE kept coming repeatedly as the RDMA meme always puzzles me. easy to see why it's a seductive idea, but I always run up against the problem of exactly how it's supposed to be used - are there any moderately well-known applications that specifically

Re: [Beowulf] TOE on Linux?

2008-05-19 Thread Prentice Bisbal
Prentice Bisbal wrote: > Does anyone know of any network cards/drivers that support TOE (TCP > Offload Engine) for Linux? A hardware vendor just told me that Linux > does not support the TOE features of *any* network card. > > Given Linux's strong presence in HPC and the value of having TOE in a >

[Beowulf] [p2s2-announce] Reminder: P2S2 Workshop Deadline Coming Up

2008-05-19 Thread Pavan Balaji
We would like to remind you all that the P2S2 Workshop deadline is coming up in a few days (May 21st). We look forward to receiving paper submissions from you. Please note in the CFP below that the actual workshop is moved from the first day of the ICPP conference (Sep. 8th) to the last day (Sep

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Perry E. Metzger
"Jeffrey B. Layton" <[EMAIL PROTECTED]> writes: > Evidently you don't know jack about real codes. Evidently. But I do know how to be polite, so I'll bow out of this conversation. Perry ___ Beowulf mailing list, Beowulf@beowulf.org To change your subscr

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Jeffrey B. Layton
Perry E. Metzger wrote: "Jeffrey B. Layton" <[EMAIL PROTECTED]> writes In any case, let me note the most important rule: if your CPUs aren't doing work most of the time, you're not allocating resources properly. If the task is really I/O bound, there is no point in having more CPU than I/O can p

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Perry E. Metzger
"Jeffrey B. Layton" <[EMAIL PROTECTED]> writes: >> Third, you could be doing lots of file i/o to legitimate data >> files. Here again, it is possible that if the files are small enough >> and your access patterns are repetitive enough that increasing your >> RAM could be enough to make everything

[Beowulf] ECC Scrub, which setting?

2008-05-19 Thread David Mathog
I finally managed to get into the BIOS in the broken IBM x3455 (the problem turned out to be a bad stick of Crucial memory in the first memory slot), and once there, discovered that not only was ECC enabled, but that this machine has a plethora of ECC settings. These were "Disabled, Basic, Good, S

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Joe Landman
Jeffrey B. Layton wrote: Here comes the $64 question - how do you benchmark the IO portion of your code so you can understand whether you need a parallel file system, what kind of connection do you need from a client to the storage, etc. This is a difficult problem and one in which I have an

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Jeffrey B. Layton
Perry E. Metzger wrote: "Jeffrey B. Layton" <[EMAIL PROTECTED]> writes: Here comes the $64 question - how do you benchmark the IO portion of your code so you can understand whether you need a parallel file system, what kind of connection do you need from a client to the storage, etc. This is

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Perry E. Metzger
"Jeffrey B. Layton" <[EMAIL PROTECTED]> writes: > Here comes the $64 question - how do you benchmark the IO portion of > your code so you can understand whether you need a parallel file > system, what kind of connection do you need from a client to the > storage, etc. This is a difficult problem a

Re: [Beowulf] Computation on the head node

2008-05-19 Thread Jeffrey B. Layton
Joe Landman wrote: Perry E. Metzger wrote: checking. A very fast RAID array may be in order -- or it may be completely unnecessary. One can't know without understanding one's application intimately, and that requires testing. Of course. But there are quite a few people/groups on this list wit