On 12/02/07, Alan McKinnon <[EMAIL PROTECTED]> wrote:

On Monday 12 February 2007, Matt Richards wrote:
> I wouldn't of thought it would be the RAM not from behavior like that
> but it does sound like a overheating issue, I have had computers just
> power off because they get too hot.

The thing with memtest is that it does (mostly) predictable tests, which
can detect consistent errors with memory chips, or somewhat random
errors with memory.

BUT, it doesn't help at all with detecting didgy hardware in other
areas. It doesn't give you any valid tests for what happens when the
machine gets hot for example. Compiling a kernel (or openoffice.org) 10
times in a loop does do that.


It does look like it's a problem with the PSU or (God forbid) the cpu. ("God
forbid" because new CPU means new motherboard also).


So, I would say that you have shown that your memory chips are not
likely to be faulty, but you have not yet proven that the cpu (or the
rest of the machine) isn't getting hot and overloading.

Jeff, if you were paying me to diagnose your problem, I'd be telling you
to swap out the psu and test some more, then disassemble the
cpu/heatsink combination and replace the thermal grease and test some
more. If it then fails, the tests are still not conclusive so I would
want to forcefully cool the machine (with an external fan etc) while
putting it through some stress tests (like a looooooong compile).


Well at the moment it doesn't even survive the early stages of a short
compile (the ./configure stage) so hopefully if any progress at all is made
at diagnosing the problem, it'll be quick. Unfortunately if I were paying
you to diagnose the problem I'd also have enough money to buy spare PSU's -
but I'm not!

Jeff

--
Now, did you hear the news today?
They say the danger's gone away
But I can hear the marching feet
Moving into the street

Adapted from Genesis, "Land of Confusion"

http://latedeveloper.org.uk

Reply via email to