Prentice Bisbal wrote:
> The server is a Dell PowerEdge R815 with 4 8-Core AMD processors and 128
> GB of RAM.
If the erroneous memory locations are moving around in memory without
correlation to the DIMMs then the next most likely culprits are a
marginal power supply, CPU, or motherboard, in p
Prentice,
Thanks for filling in some details. What you say makes complete sense to
me.
Is it the case that frigga has seen similar stress with no SBE errors? If
so, I agree it seems like something else is going on besides bad DIMMs. To
test that, if you can schedule simultaneous downtime on the
does anyone know of easy to run code that would swallow the cpu/memory
on the chassis but also a tesla card? A lot of the tools i typically
used in the past that have been ported to GPU's don't seem to use up
much of the memory, or use all the GPU constantly. I'm running
through NAMD at the momen
David,
Thanks for the e-mail due to it's length, I'm not including it in my
reply, which I know is normally bad mailing list etiquette.
The server is a Dell PowerEdge R815 with 4 8-Core AMD processors and 128
GB of RAM.
I installed two identical servers at the same time, named frigga and
odin
David,
--
Prentice
___
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf