Re: [Beowulf] Memory stress testing tools.

2010-12-07 Thread Prentice Bisbal
That was the first thing I looked into. memtest86 supports upto 64 GB of RAM. My system has 128 GB. :( I found prime95/gimps through a wikipedia page. I'm giving it a go now. http://www.mersenne.org/freesoft/#newusers On 12/07/2010 01:05 PM, Mcmillan, Scott A wrote: > memtest86 > > -Orig

Re: [Beowulf] Memory stress testing tools.

2010-12-07 Thread David Mathog
Prentice Bisbal wrote: > When the system seems > stable, I let the users back on it, and sure enough, they get it to > start reporting SBEs in short order. Sounds like you already have a good tool for triggering memory errors on that system - your user's code. Regards, David Mathog mat...@cal

[Beowulf] Memory stress testing tools.

2010-12-07 Thread Prentice Bisbal
Dear Beowulfers, Can any of you recommend a good RAM stress testing tool? I have a server with 128GB of RAM that keeps reporting single-bit errors. Every time this happens, I reseat the DIMMS or swap them around, and then run some large MPI jobs with I hope stress the RAM. Sometimes this produ