Re: [Beowulf] Moores Law is dying

Joe Landman Tue, 14 Apr 2009 15:00:46 -0700

Jon Forrest wrote:

Joe Landman wrote:

... so I see you have never used an interprocedural analysis (-ipa)switch :)
Allows you do do things like, I dunno, inline one whole routine insideanother ...


I've never used this but from your description I don't
see how it leads to larger text sizes at runtime. After all, if you have
routine A which is 10 bytes, and routine B which is 20 bytes,
it would seem that they collectively take 30 bytes no matter
if they stand alone or one inside the other. I might not
be understanding this right, though.


More like N*20 bytes ... use the routine more than once :)

Usually leads to much larger program text sizes.
This said, I have seen very large programs from RISC days hitting wellmore than 1 GB of text. I haven't played with any recently though.
Let's say this is about right. Do you see such programs getting
even larger in the future?


Sadly, yes.

Why is sharing expensive in performance? It might take a little
overhead to setup and manage, but why is having multiple virtual
addresses map to the same physical memory expensive?
Contention. Memory hot spots. Been there, done that. We are aboutto do this all over again (collectively).


Naively I would think that text memory hot spots would be a good
thing, because then all the benefits of caching would kick in.
There would be no cache coherence overhead since text is read-only.
Why is this a bad thing?

Ohhhh.... You *really* don't want your system brought to its knees overfalse sharing. Its a great way to turn a large expensive machine into avery slow large expensive machine. Listen to Greg Lindahl, and he'lllikely point to this as one of the great fallicies of 'why shared memoryis better' than distributed memory :) (not shoving words into his mouth,so if he has changed his mind or thinks differently ... thats ok)

Imagine you are a processor, and you have written to a location in ram.So now your cache line is dirty, and waiting in queue to be flushedout. In your parallel program, along comes someone else who really,really wants to read that cache line. Ok, so this forces you to a)flush it now, b) mark that line as clean. Then the next CPU gets thatcache line, does it's write, and whammo, some other CPU wants to do thesame thing to it as you did.

Sadly enough this is a common programming error in shared memoryprogramming. Think of it like you have a bunch of loops operating inparallel, all trying up update the same counter, at once. In parallel.

Each update has to wait until it can grab the cache line, and then itproceeds. The more updaters you have, the more contention for thatresource you have. Your performance scales as 1/N rather than (constant)*N.

Now do this with a page at a time, say a buffer. Like, I dunno, anInfiniband MPI buffer, or a 10 GbE MPI buffer. Throw more CPUs behindthis buffer, and force them to get in line to shoot data over to theircounterparts. The IB or 10 GbE resource becomes contended for, and asyou increase Ncpu, the contention and performance loss gets worse andworse (this is basically what Doug Eadline is worried about).

There are ways you can work around some of this stuff. Share nothing isone way, though this is hard to do at an OS level where you share IOdevices etc. Allocate some private memory queues, a scheduler, andother bits (you have to do this with Cuda systems and most acceleratorsto get reasonable performance).

I know you might postulate that 32 bit text is effectively the CSequivalent of "C" in physics ... you may approach it asymptotically, butnever actually get there ... but unlike in physics, there isn't reallyan underlying reason why you might not get there.


--
Joseph Landman, Ph.D
Founder and CEO
Scalable Informatics LLC,
email: [email protected]
web  : http://www.scalableinformatics.com
       http://jackrabbit.scalableinformatics.com
phone: +1 734 786 8423 x121
fax  : +1 866 888 3112
cell : +1 734 612 4615
_______________________________________________
Beowulf mailing list, [email protected] sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Re: [Beowulf] Moores Law is dying

Reply via email to