Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Patrick Geoffray
Bogdan Costescu wrote: long as it fits in one page. At another time, the switch was more likely to drop large frames under high load (maybe something to do with internal memory management), so the 9000bytes frames worked most of the time while the 1500bytes ones worked all the time... This is a

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Patrick Geoffray
Rahul Nabar wrote: What was your tool to measure this latency? Just curious. I like to use netperf to measure performance over Sockets, including latency (it's there but not obvious). For OS-bypass interfaces, your favorite MPI benchmark is fine. Patrick

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Patrick Geoffray
Rahul Nabar wrote: The TSO and LRO are only relevant to TCP though, aren't they? I am using RDMA so that shouldn't matter. Maybe I am wrong. TSO/LRO applies to TCP, but you can have the same technique with different protocol, USO for UDP Send Offload for example. RDMA is everything you want

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Patrick Geoffray
Rahul Nabar wrote: Thanks! So I could push it beyond 9000 as well? 1500 Bytes is the standard MTU for Ethernet, anything larger is out of spec. The convention for a larger MTU is Jumbo Frames at 9000 Bytes, and most switches support it these days. Some hardware even support Super Jumbo Frame

Re: [Beowulf] Performance degrading

2009-12-15 Thread Gus Correa
Hi Glen, Jorg Glen: Yes, you are right about MPICH1/P4 starting extra processes. However, I wonder if that is what is happening to Jorg, of if what he reported is just plain CPU oversubscription. Jorg: Do you use MPICH1/P4? How many processes did you launch on a single node, four or five? Glen

[Beowulf] PXE/TFTP and Xen Kernel Issues

2009-12-15 Thread Jess Cannata
I'm having a problem booting Xen kernels via PXE. I want to boot a machine via PXE that will then host Xen virtual machines. The client machine PXE boots, receives the pxelinux.0 file, and then grabs the Xen kernel (vmlinuz-2.6.18-164.6.1.el5xen). However, it can never load the Xen kernel. On t

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Alex Chekholko
On Tue, 15 Dec 2009 13:22:56 -0800 Greg Lindahl wrote: > On Tue, Dec 15, 2009 at 12:49:23PM -0500, Scott Atchley wrote: > > > You can test it by using the size parameter with ping: > > > > $ ping -s > > > > If they all drop, then you have exceeded the MTU of some device. > > [lind...@greg-des

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Greg Lindahl
On Tue, Dec 15, 2009 at 12:49:23PM -0500, Scott Atchley wrote: > You can test it by using the size parameter with ping: > > $ ping -s > > If they all drop, then you have exceeded the MTU of some device. [lind...@greg-desk b]$ ping -s 6 rich-desk PING rich-desk (64.13.159.69) 6(60028) by

Re: [Beowulf] Performance degrading

2009-12-15 Thread Glen Beane
On 12/15/09 2:36 PM, "Gus Correa" wrote: If you have single quad core nodes as you said, then top shows that you are oversubscribing the cores. There are five nwchem processes are running. It has been a very long time, but wasn't that normal behavior for mpich under certain instances? If

Re: [Beowulf] Performance degrading

2009-12-15 Thread Gus Correa
Hi Jorg If you have single quad core nodes as you said, then top shows that you are oversubscribing the cores. There are five nwchem processes are running. In my experience, oversubscription only works in relatively light MPI programs (say the example programs that come with OpenMPI or MPICH). R

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Rahul Nabar
On Tue, Dec 15, 2009 at 11:49 AM, Scott Atchley wrote: > On Dec 14, 2009, at 12:57 PM, Alex Chekholko wrote: > >> Set it as high as you can; there is no downside except ensuring all >> your devices are set to handle that large unit size.  Typically, if the >> device doesn't support jumbo frames, i

[Beowulf] [hpc-announce] CFP: 24th International Conference on Supercomputing (ICS'10) - 4 weeks remaining

2009-12-15 Thread Darren
[Please accept our apologies if you have received this announcement multiple times] * CALL FOR PAPERS - Submission deadline in 4 weeks 24th International Conference on Supercomputing (ICS'10)

[Beowulf] [hpc-announce] CfP: Special Issue of JPDC on "Data Intensive Computing", Submission: One month from Today

2009-12-15 Thread Surendra Byna
Dear Colleagues: The paper submission deadline for the Special Issue of Journal of Parallel and Distributed Computing (JPDC) on "Data Intensive Computing" is a month from Today (January 15th 2010). We welcome your submissions. We appreciate sharing this announcement with anyone who might be int

[Beowulf] [hpc-announce] Extended Deadline: Workshop on Large-Scale Parallel Processing (LSPP'10)

2009-12-15 Thread Darren
[Please accept our apologies if you receive multiple copies] - Call for papers: Workshop on LARGE-SCALE PARALLEL PROCESSING to be held in conjunction with IEEE International Parallel and Distributed Processing Sym

[Beowulf] [hpc-announce] Call For Papers: Intl. Workshop on Parallel Programming Models and Systems Software for HEC (P2S2)

2009-12-15 Thread Yong Chen
CALL FOR PAPERS === Third International Workshop on Parallel Programming Models and Systems Software for High-end Computing (P2S2) Sept. 13th, 2010 To be held in conjunction with ICPP-2010: The 39th International Conference on Parallel Processing, Sept. 13-16, 2010, San Diego, CA, U

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Scott Atchley
On Dec 14, 2009, at 12:57 PM, Alex Chekholko wrote: Set it as high as you can; there is no downside except ensuring all your devices are set to handle that large unit size. Typically, if the device doesn't support jumbo frames, it just drops the jumbo frames silently, which can result in odd

Re: [Beowulf] Performance tuning for Jumbo Frames

2009-12-15 Thread Alex Chekholko
On Sun, 13 Dec 2009 21:26:52 -0600 Rahul Nabar wrote: > On Sat, Dec 12, 2009 at 10:40 AM, Patrick Geoffray wrote: > > Rahul, > > > > Rahul Nabar wrote: > >> > >> I have seen a considerable performance boost for my codes by using > >> Jumbo Frames. But are there any systematic tools or strategies

[Beowulf] Performance degrading

2009-12-15 Thread Jörg Saßmannshausen
Dear all, I am scratching my head but apart from getting splinters into my fingers I cannot find a good answer for the following problem: I am running a DFT program (NWChem) in parallel on our cluster (AMD Opterons, single quad cores in the node, 12 GB of RAM, Gigabit network) and at certain st

[Beowulf] Ich bitte um Feedback bzgl. MPI / please I need your feedback

2009-12-15 Thread Eugen Leitl
- Forwarded message from Rolf Rabenseifner - From: Rolf Rabenseifner Date: Tue, 15 Dec 2009 15:42:50 +0100 (CET) To: eu...@leitl.org Subject: Ich bitte um Feedback bzgl. MPI / please I need your feedback Sehr geehrte Damen und Herren, zur Weiterentwicklung des Message Passing Interface