[mvapich-discuss] [rsh] <defunct> ????

Michael Li mli at deform.com
Wed Mar 8 15:31:14 EST 2006


Hi, all

I have a 8-node cluster, when I start my application, I saw
many processes "[rsh] <defunct>", is this normal or I did
something wrong ?

I am using mvapich-0.9.5-119 and Mellanox IB Gold Distribution (IBGD) 
v1.7.0 for Linux.

  deform at sftc001:/home/deform/infiniband> uname -a
Linux sftc001 2.6.10-suse92-i4smp #62 SMP Thu Mar 31 12:03:47 EST 2005 
i686 i686 i386 GNU/Linux
deform at sftc001:/home/deform/infiniband> cat /etc/issue

Welcome to SuSE Linux 9.2 (i586) - Kernel \r (\l).



mli at sftc001:/home/mli/tmp/fem3d> ps -ef | grep mli1
root     21007  4814  0 09:24 ?        00:00:00 sshd: mli1 [priv]
mli1     21010 21007  0 09:24 ?        00:00:00 sshd: mli1 at pts/17
mli1     21011 21010  0 09:24 pts/17   00:00:00 -tcsh
root     21576  4814  0 09:28 ?        00:00:00 sshd: mli1 [priv]
mli1     21579 21576  0 09:28 ?        00:00:00 sshd: mli1 at pts/18
mli1     21580 21579  0 09:28 pts/18   00:00:00 -tcsh
root     20058  4814  0 13:27 ?        00:00:00 sshd: mli1 [priv]
mli1     20061 20058  0 13:27 ?        00:00:00 sshd: mli1 at pts/22
mli1     20062 20061  0 13:27 pts/22   00:00:00 -tcsh
mli1     21234     1  0 13:31 ?        00:00:00 xterm
mli1     21236 21234  0 13:31 pts/20   00:00:00 -csh
mli1     30517 21011  0 15:28 pts/17   00:00:00 /bin/csh -f 
/home/mli1/ravi/image/COM/DEF_ARM.COM ConePreform_360deg_DB3_Take2 B
mli1     30603 30517  0 15:28 pts/17   00:00:00 /bin/sh 
/home/mli1/ravi/image/COM/DEF_SIM_CTL.COM >ConePreform_360deg_DB3_Take2.MSG
mli1     30649 30603  0 15:28 pts/17   00:00:00 /bin/sh 
/home/mli1/ravi/image/mvapich/bin/mpirun -np 8 -hostfile 
/home/mli1/ravi/image/mvapich/share/machines/hosts.list 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30677 30649  0 15:28 pts/17   00:00:00 
/home/mli1/ravi/image/mvapich/bin/mpirun_rsh -rsh -np 8 -hostfile 
/home/mli1/ravi/image/mvapich/share/machines/hosts.list 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30678 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc001 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30679 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc002 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=1 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30680 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc003 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=2 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30681 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc004 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=3 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30683 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc005 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=4 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30684 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc006 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=5 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30685 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc007 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=6 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30686 30677  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc008 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=7 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30687 30682  0 15:28 ?        00:00:00 tcsh -c cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30690 30680  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30692 30678  0 15:28 pts/17   00:00:00 /usr/bin/rsh sftc001 cd 
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001 
MPIRUN_PORT=52936 
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:' 
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE 
 >ConePreform_360deg_DB3_Take2.MSG
mli1     30697 30681  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30702 30685  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30703 30679  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30704 30686  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30706 30684  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30711 30683  0 15:28 pts/17   00:00:00 [rsh] <defunct>
mli1     30712 30687 96 15:28 ?        00:00:48 
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
mli      30894 17880  0 15:29 pts/16   00:00:00 grep mli1



==========
This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or otherwise protected from disclosure. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipients, please contact the sender by reply email and destroy the original message and any copies of the message as well as any attachments to the original message.



More information about the mvapich-discuss mailing list