[mvapich-discuss] mpi error?

sk sdk0084 at yahoo.com
Fri May 10 02:44:24 EDT 2013


Hi Devendar,
This is what I've got after following your suggestions:

[sandy2:mpispawn_1][readline] Unexpected End-Of-File on file descriptor 8. MPI process died?
[sandy2:mpispawn_1][mtpmi_processops] Error while reading PMI socket. MPI process died?
[sandy2:mpispawn_1][child_handler] MPI process (rank: 36, pid: 22849) exited with status 152
[sandy1.localdomain:mpispawn_0][read_size] Unexpected End-Of-File on file descriptor 38. MPI process died?
[sandy1.localdomain:mpispawn_0][read_size] Unexpected End-Of-File on file descriptor 38. MPI process died?
[sandy1.localdomain:mpispawn_0][handle_mt_peer] Error while reading PMI socket. MPI process died?

Is it make clear what could be the problem? 

Regards,
SK





________________________________
 From: Devendar Bureddy <bureddy at cse.ohio-state.edu>
To: sk <sdk0084 at yahoo.com> 
Cc: "mvapich-discuss at cse.ohio-state.edu" <mvapich-discuss at cse.ohio-state.edu> 
Sent: Wednesday, May 1, 2013 5:05 PM
Subject: Re: [mvapich-discuss] mpi error?
 


Hi Sk

It is little hard to say what is going wrong with given error message. Can you please add "--enable-g=all --enable-fast=none --enable-error-checking=all" to the configuration and run with MV2_DEBUG_SHOW_BACKTRACE=1 to see if this show any better error message. 
Can you also give a try with mpirun_rsh launcher?

-Devendar


On Wed, May 1, 2013 at 3:16 AM, sk <sdk0084 at yahoo.com> wrote:

Hi there,
>My WRF simulation crashed with the following  error message:
>=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
>=   EXIT CODE: 152
>=   CLEANING UP REMAINING PROCESSES
>=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
>
>
>
>
>some details:
>
>
>mpirun -np 64 -f hosts ./wrf.exe
>
>
>
>./mpich2version
>MVAPICH2 Version:         1.9a2
>MVAPICH2 Release date:    Thu Nov  8 11:43:52 EST 2012
>MVAPICH2 Device:          ch3:mrail
>MVAPICH2 configure:      
 --prefix=/mnt/raid5/mvapich2/intel --disable-mcast
>MVAPICH2 CC:      icc    -DNDEBUG -DNVALGRIND -O2
>MVAPICH2 CXX:     icpc   -DNDEBUG -DNVALGRIND -O2
>MVAPICH2 F77:     ifort -L/lib -L/lib   -O2
>MVAPICH2 FC:      ifort   -O2
>
>
>
>Scientific Linux release 6.3 (Carbon) with infiniband network 
>
>Linux sandy1.localdomain 2.6.32-279.el6.x86_64 #1 SMP Thu Jun 21 07:08:44 CDT 2012 x86_64 x86_64 x86_64 GNU/Linux
>
>
>the model doesn't report any error message. what could be the problem?
>
>
>Thanks!SK
>_______________________________________________
>mvapich-discuss mailing list
>mvapich-discuss at cse.ohio-state.edu
>http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>



-- 
Devendar 
_______________________________________________
mvapich-discuss mailing list
mvapich-discuss at cse.ohio-state.edu
http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20130509/f4f415e0/attachment.html


More information about the mvapich-discuss mailing list