[mvapich-discuss] What's a cause?

Satoshi Isono isono at cray.com
Thu Jun 18 03:47:14 EDT 2009


Hello everyone,

When I used MVAPICH 1.0.1, I got errors as below after two minutes. MPI
size is 2,560 processes. I think this problem was caused system trouble
on each compute node. I would like to know everyone's thought. Messages
shows that some of shared libraries cannot load. Are there any key items
as below error messages?

MPI process terminated unexpectedly
MPI process terminated unexpectedly
MPI process terminated unexpectedly
MPI process terminated unexpectedly
MPI process terminated unexpectedly
MPI process terminated unexpectedly
MPI process terminated unexpectedly
forrtl: error (69): process interrupted (SIGINT)
Image              PC                Routine            Line
Source
libpthread.so.0    0000003D3FE0DE60  Unknown               Unknown
Unknown
libpthread.so.0    0000003D3FE0CC79  Unknown               Unknown
Unknown
libibverbs.so.1    0000003D3F606B2F  Unknown               Unknown
Unknown
nhm_driver-2       0000000000CB3642  Unknown               Unknown
Unknown
libpthread.so.0    0000003D3FE062E7  Unknown               Unknown
Unknown
libc.so.6          0000003D3F2CE3BD  Unknown               Unknown
Unknown
forrtl: error (69): process interrupted (SIGINT)
Image              PC                Routine            Line
Source
nhm_driver-2       000000000068AC82  Unknown               Unknown
Unknown
nhm_driver-2       00000000004059D6  Unknown               Unknown
Unknown
nhm_driver-2       0000000000405942  Unknown               Unknown
Unknown
libc.so.6          0000003D3F21D8A4  Unknown               Unknown
Unknown
nhm_driver-2       0000000000405869  Unknown               Unknown
Unknown
MPI process terminated unexpectedly


Regards,
Satoshi Isono




More information about the mvapich-discuss mailing list