[mvapich-discuss] MPIDI_CH3I_CM_Init: Error initializing MVAPICH2 malloc library

Gilles Civario gilles.civario at ichec.ie
Wed Sep 7 11:01:24 EDT 2011


/* re-send in plain text rather than HTML. Sorry guys */

Hi guys,

I installed mvapich2-1.7-rc1 on one of our cluster to test some features, and with the code I played with, I encountered a problem: while scaling my runs using powers of 2 processes, the code ran beautifully up to 64 processes, and with 128 gave me, for each process, the following message:

Fatal error in MPI_Init:
Other MPI error, error stack:
MPIR_Init_thread(310)..: Initialization failed
MPID_Init(113).........: channel initialization failed
MPIDI_CH3_Init(161)....:
MPIDI_CH3I_CM_Init(828): Error initializing MVAPICH2 malloc library

After googleing the web and browsing the sources, I suspected an issue of some sort with my max locked memory limit. But I managed to rule it out.
Then I suspected some issue with our mpiexec (which is not the one coming with the library), but here again, I ruled it out.
So I tried a more systematic approach and I searched in the on line documentation all environment variables I could play with which default to 128 and set them one after the other to 256... Still no joy. Then I did the same with the ones default to 64 and set them to 128. And then, MV2_ON_DEMAND_THRESHOLD did the trick!
So my issue is now solved. However, I have a few questions / requests:
   * what is the impact for me to set this specific environment variable, to a value large enough so that I don't experiment those crashes in the future?
    * Could it be possible to either document the error / message, or to make it more explicit so that people experimenting the same issue can sort it out more easily than I did?

Thank you for your great job guys.

Cheers.

Gilles Civario



More information about the mvapich-discuss mailing list