[mvapich-discuss] process limits
amith rajith mamidala
mamidala at cse.ohio-state.edu
Tue Aug 28 12:30:50 EDT 2007
Hi Mark,
Can you check if you get this error by setting the environment variable:
VIADEV_USE_SHMEM_COLL to 0 e.g. mpirun_rsh -np N VIADEV_USE_SHMEM_COLL=0
./a.out
-thanks,
Amith
On Tue, 28 Aug 2007, Mark Potts wrote:
> Hi,
> Is there an effective or hard limit on the number of MVAPICH
> processes that can be run on a single node?
>
> Given N cpus, each having M cores, on a single node, I've been told
> that one can not run more than N*M MVAPICH processes on a single
> node. In fact, I observe that if I try to even approach this number
> with "-np 16" (for a node with N=8 and M=4), I observe a "unable to
> find child nnnn!" or "Child died" message. Is this a configuration
> problem with this system or somehow an expected behavior?
>
> More pointedly, should oversubscription of cores, np > N*M, on a
> single node work in MVAPICH? How about in MVAPICH2?
>
> regards,
> --
> ***********************************
> >> Mark J. Potts, PhD
> >>
> >> HPC Applications Inc.
> >> phone: 410-992-8360 Bus
> >> 410-313-9318 Home
> >> 443-418-4375 Cell
> >> email: potts at hpcapplications.com
> >> potts at excray.com
> ***********************************
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
More information about the mvapich-discuss
mailing list