[mvapich-discuss] Help! Problems Slurm and MVAPICH2

José Manuel Molero jmlero at hotmail.com
Thu Sep 13 03:58:59 EDT 2012


Hello,

We have a new cluster with an Infiniband network, and I think that Slurm and MVAPICH2 would be the best option in this case.

I have configured SLURM 2.3.2 on Ubuntu Server and its works.

Now I tried to install MVAPICH2 1.8, with the following:

./configure --with-pm=none --with-pmi=slurm ;  make ; make install  (in the front end and all the compute nodes)

But it dosent work.

I compile using :

mpicc file.c -o file -lpmi -L/usr/include/slurm/

and then:

 srun -N2 file

And the result is:

In: PMI_Abort(1, Fatal error in MPI_Init:
Other MPI error
)
In: PMI_Abort(1, Fatal error in MPI_Init:
Other MPI error
)
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
srun: error: node17: task 1: Exited with exit code 1
slurmd[node16]: *** STEP 102.0 KILLED AT 2012-09-13T09:56:18 WITH SIGNAL 9 ***
srun: error: node16: task 0: Exited with exit code 1
slurmd[node16]: *** STEP 102.0 KILLED AT 2012-09-13T09:56:18 WITH SIGNAL 9 ***



And the output of mpiname -a

MVAPICH2 1.8 Mon Apr 30 14:56:40 EDT 2012 ch3:mrail

Compilation
CC: gcc    -DNDEBUG -DNVALGRIND -O2
CXX: c++   -DNDEBUG -DNVALGRIND -O2
F77: gfortran   -O2 
FC: gfortran   -O2

Configuration
--with-pm=none --with-pmi=slurm



What I'm doing wrong?


Thanks in advance





 		 	   		  
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20120913/4b9646bc/attachment.html


More information about the mvapich-discuss mailing list