Re: [Beowulf] Memory stress testing tools

2010-12-10 Thread David Mathog
Prentice Bisbal wrote: > The server is a Dell PowerEdge R815 with 4 8-Core AMD processors and 128 > GB of RAM. If the erroneous memory locations are moving around in memory without correlation to the DIMMs then the next most likely culprits are a marginal power supply, CPU, or motherboard, in p

Re: [Beowulf] Memory stress testing tools.

2010-12-10 Thread David Kewley
Prentice, Thanks for filling in some details. What you say makes complete sense to me. Is it the case that frigga has seen similar stress with no SBE errors? If so, I agree it seems like something else is going on besides bad DIMMs. To test that, if you can schedule simultaneous downtime on the

[Beowulf] tesla benchmarking

2010-12-10 Thread Michael Di Domenico
does anyone know of easy to run code that would swallow the cpu/memory on the chassis but also a tesla card? A lot of the tools i typically used in the past that have been ported to GPU's don't seem to use up much of the memory, or use all the GPU constantly. I'm running through NAMD at the momen

Re: [Beowulf] Memory stress testing tools.

2010-12-10 Thread Prentice Bisbal
David, Thanks for the e-mail due to it's length, I'm not including it in my reply, which I know is normally bad mailing list etiquette. The server is a Dell PowerEdge R815 with 4 8-Core AMD processors and 128 GB of RAM. I installed two identical servers at the same time, named frigga and odin

Re: [Beowulf] Memory stress testing tools.

2010-12-10 Thread Prentice Bisbal
David, -- Prentice ___ Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing To change your subscription (digest mode or unsubscribe) visit http://www.beowulf.org/mailman/listinfo/beowulf