Re: [Beowulf] Anybody going with AMD Istanbul 6-core CPU's?

2009-08-27 Thread Jason Clinton
On Thu, Aug 20, 2009 at 11:01 AM, Steve Cousins wrote: > I haven't seen anybody here talking about the 6-core AMD CPU's yet. Is > anybody trying these out? Anybody have real-world comparisons (say WRF) of > scalability of a 12-core system vs. a 16 thread Nehalem system? I ran a benchmark awhile ag

Re: [Beowulf] Odd AMD quad core SuperMicro power off issues

2009-07-06 Thread Jason Clinton
On Fri, Jul 3, 2009 at 12:17 AM, Chris Samuel wrote: >> Well we've been gradually replacing the Barcelona chips >> with Shanghai (same clockspeed) and we are yet to see a >> power off on a Shanghai node! > > Since I wrote that we have seen far fewer with 2.3GHz > Shanghai (2376, a 75W part), *but*

Re: [Beowulf] ECC Memory and Job Failures

2009-04-24 Thread Jason Clinton
On Fri, Apr 24, 2009 at 12:49 AM, John Hearns wrote: > 2009/4/23 Nifty Tom Mitchell : > > On Thu, Apr 23, 2009 at 04:45:08PM +0100, Huw Lynes wrote: > > > > IMO Running on a large cluster without multiple bit detection and a > minimum of one bit > > correction ECC is silly. > > > > Further running

Re: [Beowulf] Issue in finding libraries on the unix path

2009-04-16 Thread Jason Clinton
On Mon, Apr 13, 2009 at 4:34 PM, Francesco Pietra < francesco.pie...@accademialucchese.it> wrote: > Question: could you please clarify how to implement the above "-rpath > linkage option"? With gcc and icc, you do "-Wl,-rpath=/path/to/your/libraries" > Also, is there any hope to satisfy the f9

Re: [Beowulf] Nehalem memory configs

2009-04-13 Thread Jason Clinton
On Sun, Apr 12, 2009 at 10:43 PM, Jason Clinton < jclin...@advancedclustering.com> wrote: > No, it does not degrade performance as long as the CAS latencies of both > DIMM types are identical and you would not get a clock-down by using that > quantity of RAM ranks. We have been

Re: [Beowulf] Nehalem memory configs

2009-04-13 Thread Jason Clinton
On Wed, Apr 8, 2009 at 2:49 PM, Peter Kjellstrom wrote: > Can anyone confirm that mixing different sizes of dimms, even when keeping > to > a by-three-symmetric configuration, actually does degrade performance? > > That is, first hand information that config 1 will be slower than config 2: > > 1:

Re: [Beowulf] Weird CentOS Install Problem

2008-08-11 Thread Jason Clinton
On Fri, Aug 8, 2008 at 9:57 PM, Jon Forrest <[EMAIL PROTECTED]> wrote: > Matt Lawrence wrote: >> >> On Fri, 8 Aug 2008, Jon Forrest wrote: >> >>> What's weird about this is that the root file system starts >>> on cylinder 1, as confirmed by the fdisk command. This is >>> using a brand new SuperMicr

Re: [Beowulf] copying big files (Henning Fehrmann)

2008-08-11 Thread Jason Clinton
On Fri, Aug 8, 2008 at 11:11 AM, David Mathog <[EMAIL PROTECTED]> wrote: > Henning Fehrmann <[EMAIL PROTECTED]> wrote: > >> Coping a big file onto all nodes in a cluster is a rather >> common problem. I would have thought that there might be a >> standard tool for distributing the files in an effic

Re: [Beowulf] Seeing ECC errors since upgraded from Opteron 246 to 275

2008-08-06 Thread Jason Clinton
On Sat, Aug 2, 2008 at 6:57 AM, Paulo Afonso Lopes <[EMAIL PROTECTED]> wrote: > Thanks, Mark > >>> So I have 2 DL145-G2 nodes with 2 single-core 246 / 4GB each, and 2 >>> DL145-G2 nodes with 2 dual-core 275 / 4GB each. >> >> it's worth making sure you have current bios installed. >> > Not the lates

Re: [Beowulf] Can one Infiniband net support MPI and a parallel file system?

2008-08-06 Thread Jason Clinton
On Tue, Aug 5, 2008 at 4:25 PM, Gus Correa <[EMAIL PROTECTED]> wrote: > Is anybody using Infiniband to provide both > MPI connection and parallel file system services on a Beowulf cluster? > > I thought to have a storage node that would > serve a parallel file system to the beowulf nodes over IB >

Re: [Beowulf] Barcelona hardware error: how to detect

2008-06-10 Thread Jason Clinton
On Thu, Jun 5, 2008 at 11:39 AM, Mikhail Kuzminsky <[EMAIL PROTECTED]> wrote: > In message from Mark Hahn <[EMAIL PROTECTED]> (Thu, 5 Jun 2008 11:57:28 > -0400 (EDT)): > >> To be more exact, Rev. B2 of Opteron 2350 - is it for CPU stepping w/error >>> or w/o error ? >>> >> >> AMD, like Intel, does

Re: [Beowulf] Barcelona hardware error: how to detect

2008-06-10 Thread Jason Clinton
On Thu, Jun 5, 2008 at 1:09 PM, Mikhail Kuzminsky <[EMAIL PROTECTED]> wrote: > In message from Mark Hahn <[EMAIL PROTECTED]> (Thu, 5 Jun 2008 13:55:01 > -0400 (EDT)): > >> I'm mystified by this: B2 was broken, so using it without the bios >> workaround is just a mistake or masochism. the workarou

Re: [Beowulf] OFED/IB for FC8

2008-06-10 Thread Jason Clinton
On Thu, Jun 5, 2008 at 4:38 AM, Rainer Finocchiaro < [EMAIL PROTECTED]> wrote: > Hi Michael, > > Greg Lindahl schrieb: > >> All the OFED rpm's for FC6 installed on FC8 without difficulty, except for >>> opensm-3.0.3-0.ppc64.rpm >>> >> >> This is the cause of most of your subsequent problems. Witho

Re: [Beowulf] OFED/IB for FC8

2008-06-10 Thread Jason Clinton
On Thu, Jun 5, 2008 at 10:38 AM, Jason Clinton < [EMAIL PROTECTED]> wrote: > ls | grep -oP .+?\(\?=.x86_64\\.rpm\) | xargs rpm -e > Of course, replace "x86_64" with "ppc64" if indeed that is what you installed. _