[mvapich-discuss] Can you help me ?

sunway qilu sunwaycn at gmail.com
Mon May 14 21:32:50 EDT 2007


Hello,
I have the same problem with the blcr and mvapich2-0.98 ,following is  error
which I have:

[fortest at fes01 ~]$ mpdboot -n 2
[fortest at fes01 ~]$ ps ax|grep mpd
13976 ?        S      0:00 python2.3 /usr/local/mvapich2p2/bin/mpd.py
--ncpus=1 -e -d
13977 pts/16   S      0:00 ssh -x -n gn01 /usr/local/mvapich2p2/bin/mpd.py
-h fes01 -p 46910  --ncpus=1 -e -d
14003 pts/16   S+     0:00 grep mpd

[fortest at fes01 ~]$ mpiexec -n 2 ./cpi
[Rank 0][cr.c: line 124]connect 24678 failed
rank 0 in job 1  fes01_46910   caused collective abort of all ranks
  exit status of rank 0: killed by signal 9


following is the config file  of the user fortest:

[fortest at fes01 ~]$ cat .bashrc
# .bashrc

# User specific aliases and functions

# Source global definitions
if [ -f /etc/bashrc ]; then
        . /etc/bashrc
fi
export MPI_ROOT=/usr/local/mvapich2p2
export PATH=$MPI_ROOT/bin:$PATH
export MV2_CKPT_FILE=/home/fortest/ckptfile
export MV2_CKPT_INTERVAL=20
export MV2_CKPT_MAX_SAVE_CKPTS=3
export MV2_CKPT_MPD_BASE_PORT=24678
export MV2_CKPT_MPIEXEC_PORT=14678
export VIADEV_DEFAULT_TIME_OUT=16


[fortest at fes01 ~]$ cat mpd.hosts
gn01
gn02
gn03
gn04
gn05
gn06
gn07
gn08

I wait for your answers and thanks a lot.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20070515/3b7485a0/attachment.html


More information about the mvapich-discuss mailing list