[mvapich-discuss] Errors spawning processes with mpirun_rsh

Jaidev Sridhar sridharj at cse.ohio-state.edu
Mon Feb 23 09:46:55 EST 2009


Hi Rafael,

The message indicates that the application terminated  with a non zero 
error code or crashed after launching. Can you check if it leaves any 
core files? You may need to set ulimit to  unlimited. For example, add 
ulimit -c unlimited in your ~/.bashrc.

Can you also give us details of the cluster and any options you've 
enabled with MVAPICH / MVAPICH2?

-Jaidev

On 02/23/2009 04:45 AM, Rafael Arco Arredondo wrote:
> Hello,
> 
> I'm having some issues with mpirun_rsh within both MVAPICH 1.1 and
> MVAPICH2 1.2p1. As I commented in another email to the list some time
> ago, mpirun_rsh is the only mechanism we can use to create MPI processes
> in our configuration.
> 
> The command issued is:
> mpirun_rsh -ssh -np 2 -hostfile ./machines ./mpihello
> 
> And the error reported by mpirun_rsh is:
> 
> Exit code -5 signaled from localhost
> MPI process terminated unexpectedly
> Killing remote processes...DONE
> 
> We also got this on some of our machines:
> 
> Child exited abnormally!
> Killing remote processes...DONE
> 
> mpihello is a simple hello world and this happens even when the
> processes are launched on localhost only.
> 
> OFED 1.2 is used as the underlying Infiniband libraries, and both
> MVAPICH and MVAPICH2 were compiled with the OpenFabrics/Gen2 single-rail
> option, without XRC as indicated in the user's guide for OFED libraries
> prior to version 1.3.
> 
> Any help will be kindly appreciated.
> 
> Thank you in advance,
> 
> Rafa
> 
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
> 



More information about the mvapich-discuss mailing list