[mvapich-discuss] [rsh] <defunct> ????
Michael Li
mli at deform.com
Wed Mar 8 15:31:14 EST 2006
Hi, all
I have a 8-node cluster, when I start my application, I saw
many processes "[rsh] <defunct>", is this normal or I did
something wrong ?
I am using mvapich-0.9.5-119 and Mellanox IB Gold Distribution (IBGD)
v1.7.0 for Linux.
deform at sftc001:/home/deform/infiniband> uname -a
Linux sftc001 2.6.10-suse92-i4smp #62 SMP Thu Mar 31 12:03:47 EST 2005
i686 i686 i386 GNU/Linux
deform at sftc001:/home/deform/infiniband> cat /etc/issue
Welcome to SuSE Linux 9.2 (i586) - Kernel \r (\l).
mli at sftc001:/home/mli/tmp/fem3d> ps -ef | grep mli1
root 21007 4814 0 09:24 ? 00:00:00 sshd: mli1 [priv]
mli1 21010 21007 0 09:24 ? 00:00:00 sshd: mli1 at pts/17
mli1 21011 21010 0 09:24 pts/17 00:00:00 -tcsh
root 21576 4814 0 09:28 ? 00:00:00 sshd: mli1 [priv]
mli1 21579 21576 0 09:28 ? 00:00:00 sshd: mli1 at pts/18
mli1 21580 21579 0 09:28 pts/18 00:00:00 -tcsh
root 20058 4814 0 13:27 ? 00:00:00 sshd: mli1 [priv]
mli1 20061 20058 0 13:27 ? 00:00:00 sshd: mli1 at pts/22
mli1 20062 20061 0 13:27 pts/22 00:00:00 -tcsh
mli1 21234 1 0 13:31 ? 00:00:00 xterm
mli1 21236 21234 0 13:31 pts/20 00:00:00 -csh
mli1 30517 21011 0 15:28 pts/17 00:00:00 /bin/csh -f
/home/mli1/ravi/image/COM/DEF_ARM.COM ConePreform_360deg_DB3_Take2 B
mli1 30603 30517 0 15:28 pts/17 00:00:00 /bin/sh
/home/mli1/ravi/image/COM/DEF_SIM_CTL.COM >ConePreform_360deg_DB3_Take2.MSG
mli1 30649 30603 0 15:28 pts/17 00:00:00 /bin/sh
/home/mli1/ravi/image/mvapich/bin/mpirun -np 8 -hostfile
/home/mli1/ravi/image/mvapich/share/machines/hosts.list
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30677 30649 0 15:28 pts/17 00:00:00
/home/mli1/ravi/image/mvapich/bin/mpirun_rsh -rsh -np 8 -hostfile
/home/mli1/ravi/image/mvapich/share/machines/hosts.list
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30678 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc001 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30679 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc002 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=1 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30680 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc003 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=2 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30681 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc004 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=3 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30683 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc005 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=4 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30684 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc006 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=5 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30685 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc007 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=6 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30686 30677 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc008 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=7 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30687 30682 0 15:28 ? 00:00:00 tcsh -c cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30690 30680 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30692 30678 0 15:28 pts/17 00:00:00 /usr/bin/rsh sftc001 cd
/home/mli1/mike; /usr/bin/env MPIRUN_MPD=0 MPIRUN_HOST=sftc001
MPIRUN_PORT=52936
MPIRUN_PROCESSES='sftc001:sftc002:sftc003:sftc004:sftc005:sftc006:sftc007:sftc008:'
MPIRUN_RANK=0 MPIRUN_NPROCS=8 MPIRUN_ID=30677 DISPLAY=michael:0.0
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
>ConePreform_360deg_DB3_Take2.MSG
mli1 30697 30681 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30702 30685 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30703 30679 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30704 30686 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30706 30684 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30711 30683 0 15:28 pts/17 00:00:00 [rsh] <defunct>
mli1 30712 30687 96 15:28 ? 00:00:48
/home/mli1/ravi/image/EXE/DEF_SIM_P4_INFINIBAND.EXE
mli 30894 17880 0 15:29 pts/16 00:00:00 grep mli1
==========
This email message and any attachments are for the sole use of the intended recipients and may contain proprietary and/or confidential information which may be privileged or otherwise protected from disclosure. Any unauthorized review, use, disclosure or distribution is prohibited. If you are not the intended recipients, please contact the sender by reply email and destroy the original message and any copies of the message as well as any attachments to the original message.
More information about the mvapich-discuss
mailing list