[mvapich-discuss] mpi error?
sk
sdk0084 at yahoo.com
Fri May 10 02:44:24 EDT 2013
Hi Devendar,
This is what I've got after following your suggestions:
[sandy2:mpispawn_1][readline] Unexpected End-Of-File on file descriptor 8. MPI process died?
[sandy2:mpispawn_1][mtpmi_processops] Error while reading PMI socket. MPI process died?
[sandy2:mpispawn_1][child_handler] MPI process (rank: 36, pid: 22849) exited with status 152
[sandy1.localdomain:mpispawn_0][read_size] Unexpected End-Of-File on file descriptor 38. MPI process died?
[sandy1.localdomain:mpispawn_0][read_size] Unexpected End-Of-File on file descriptor 38. MPI process died?
[sandy1.localdomain:mpispawn_0][handle_mt_peer] Error while reading PMI socket. MPI process died?
Is it make clear what could be the problem?
Regards,
SK
________________________________
From: Devendar Bureddy <bureddy at cse.ohio-state.edu>
To: sk <sdk0084 at yahoo.com>
Cc: "mvapich-discuss at cse.ohio-state.edu" <mvapich-discuss at cse.ohio-state.edu>
Sent: Wednesday, May 1, 2013 5:05 PM
Subject: Re: [mvapich-discuss] mpi error?
Hi Sk
It is little hard to say what is going wrong with given error message. Can you please add "--enable-g=all --enable-fast=none --enable-error-checking=all" to the configuration and run with MV2_DEBUG_SHOW_BACKTRACE=1 to see if this show any better error message.
Can you also give a try with mpirun_rsh launcher?
-Devendar
On Wed, May 1, 2013 at 3:16 AM, sk <sdk0084 at yahoo.com> wrote:
Hi there,
>My WRF simulation crashed with the following error message:
>= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
>= EXIT CODE: 152
>= CLEANING UP REMAINING PROCESSES
>= YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
>
>
>
>
>some details:
>
>
>mpirun -np 64 -f hosts ./wrf.exe
>
>
>
>./mpich2version
>MVAPICH2 Version: 1.9a2
>MVAPICH2 Release date: Thu Nov 8 11:43:52 EST 2012
>MVAPICH2 Device: ch3:mrail
>MVAPICH2 configure:
--prefix=/mnt/raid5/mvapich2/intel --disable-mcast
>MVAPICH2 CC: icc -DNDEBUG -DNVALGRIND -O2
>MVAPICH2 CXX: icpc -DNDEBUG -DNVALGRIND -O2
>MVAPICH2 F77: ifort -L/lib -L/lib -O2
>MVAPICH2 FC: ifort -O2
>
>
>
>Scientific Linux release 6.3 (Carbon) with infiniband network
>
>Linux sandy1.localdomain 2.6.32-279.el6.x86_64 #1 SMP Thu Jun 21 07:08:44 CDT 2012 x86_64 x86_64 x86_64 GNU/Linux
>
>
>the model doesn't report any error message. what could be the problem?
>
>
>Thanks!SK
>_______________________________________________
>mvapich-discuss mailing list
>mvapich-discuss at cse.ohio-state.edu
>http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>
--
Devendar
_______________________________________________
mvapich-discuss mailing list
mvapich-discuss at cse.ohio-state.edu
http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20130509/f4f415e0/attachment.html
More information about the mvapich-discuss
mailing list