[mvapich-discuss] Hard to diagnose errors

Matthew Russell matthew.g.russell at gmail.com
Fri Jul 27 14:28:26 EDT 2012


Hi,

I'm trying to run CMAQ, an air quality model, on a cluster with mvapich
using slurm.  I don't understand this error though:

$ salloc mpirun_rsh -hostfile machines8 -np 2
/home/matt/models/cmaq/trunk/bld/dena-451/scripts/cctm/CCTM_e2a_Linux2_x86_64pgi
salloc: Granted job allocation 85
[cli_1]: readline failed
[cli_1]: readline failed
[cli_0]: readline failed
[cli_0]: readline failed
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(388)...........:
MPID_Init(125)..................:
MPIDI_Populate_vc_node_ids(1222):
MPID_Get_max_node_id(822).......: PMI_KVS_Put returned -1
salloc: Relinquishing job allocation 85

What does this mean?

The executable was compiled with mpf90 and mpcc, using the mvapich
binaries, etc.

Thanks
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20120727/e69112a4/attachment.html


More information about the mvapich-discuss mailing list