David Mathog wrote:
Have any of you CUDA folks produced diagnostic programs you run during
"burn in" of new GPU based systems, in order to weed out problem units
before putting them into service?

A while ago I wrote a CUDA implementation of a subset of the Memtest86+ algorithms,to test the reliability of the consumer GPUs used by our distributed computing project, GPUGRID. You can get them here:

http://ccs.chem.ucl.ac.uk/~matt/cudamemtest.tgz

That said, we never really used it in anger (most of the stability problems we were having turned out to be due to 'factory-overclocked' GPUs) so YMMV.

MJH


--
Matt Harvey                     Email: m.j.har...@imperial.ac.uk
HPC Systems Support Analyst
Imperial College London
                                PGP Key ID: 0xD234302E

http://www.imperial.ac.uk/ict/services/highperformancecomputing

_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to