Hello,
Traditionally, a parallel application is run like following
>>> export FOO=foo
>>> mpirun -np 2 -machinefile mymachinefile my_parallel_app [app options]
(To be known by all the nodes of my cluster, the environment variable
FOO has to be set on all the nodes or exported on the nodes b
Hi all,
I use MPICH-1.2.5.2 to generate and run an FEM parallel application.
During a parallel run, one process can crash, leaving the other
processes run and OS commands have to be used for kill these zombies.
So, does someone have a solution to avoid zombies after a failed
parallel run: can