[mvapich-discuss] mvapich 1.1 mpirun_rsh problems on process count > 350

Johnny Devaprasad johnnydevaprasad at gmail.com
Mon Mar 28 05:35:53 EDT 2011


Hi all,

mpirun_rsh has problems when launching jobs with np > 350.
The resulting error is as follows:

PMGR_COLLECTIVE ERROR: unexpected value: received 1, expecting 7 @ file
pmgr_collective_mpispawn.c:144

Any suggestions, about how to fix this would be greatly appreciated.
I tried increasing the limits by VIADEV ON DEMAND THRESHOLD , but
does not seem to help. I could be tuning the wrong variable.

The IB stack that is being used is the default redhat 5 infiniband stack.
I do not have information about the IB adapters, but if that is contributing
to this
error, then please do let me know.

Thank you in advance.

Regards,
Johnny
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20110328/f55f3866/attachment.html


More information about the mvapich-discuss mailing list