[mvapich-discuss] Help! Problems Slurm and MVAPICH2
José Manuel Molero
jmlero at hotmail.com
Thu Sep 13 03:58:59 EDT 2012
Hello,
We have a new cluster with an Infiniband network, and I think that Slurm and MVAPICH2 would be the best option in this case.
I have configured SLURM 2.3.2 on Ubuntu Server and its works.
Now I tried to install MVAPICH2 1.8, with the following:
./configure --with-pm=none --with-pmi=slurm ; make ; make install (in the front end and all the compute nodes)
But it dosent work.
I compile using :
mpicc file.c -o file -lpmi -L/usr/include/slurm/
and then:
srun -N2 file
And the result is:
In: PMI_Abort(1, Fatal error in MPI_Init:
Other MPI error
)
In: PMI_Abort(1, Fatal error in MPI_Init:
Other MPI error
)
srun: Job step aborted: Waiting up to 2 seconds for job step to finish.
srun: error: node17: task 1: Exited with exit code 1
slurmd[node16]: *** STEP 102.0 KILLED AT 2012-09-13T09:56:18 WITH SIGNAL 9 ***
srun: error: node16: task 0: Exited with exit code 1
slurmd[node16]: *** STEP 102.0 KILLED AT 2012-09-13T09:56:18 WITH SIGNAL 9 ***
And the output of mpiname -a
MVAPICH2 1.8 Mon Apr 30 14:56:40 EDT 2012 ch3:mrail
Compilation
CC: gcc -DNDEBUG -DNVALGRIND -O2
CXX: c++ -DNDEBUG -DNVALGRIND -O2
F77: gfortran -O2
FC: gfortran -O2
Configuration
--with-pm=none --with-pmi=slurm
What I'm doing wrong?
Thanks in advance
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20120913/4b9646bc/attachment.html
More information about the mvapich-discuss
mailing list