[mvapich-discuss] [Checkpoint] BLCR call cr_poll_checkpoint() failed with error 2363: Request is invalid across a restart

Александр Твеленев santvel at mail.ru
Sun Jul 17 02:08:27 EDT 2011


Hello group,
I installed BLCR (with configures " --prefix=/home/santvel/BLCR_MVAPICH2/BLCR --enable-static --enable-testsuite") and


MVAPICH2 (with configures "
 --with-rdma=gen2 --enable-romio --with-file-system=lustre+nfs --with-blcr=/home/santvel/BLCR_MVAPICH2/BLCR --with-blcr-include=/home/santvel/BLCR_MVAPICH2/BLCR/include --with-blcr-libpath=/home/santvel/BLCR_MVAPICH2/BLCR/library 
") in the one node of the claster. 

Run my test program. 

/home/santvel/BLCR_MVAPICH2/MVAPICH2/bin/mpirun_rsh -np 2 --hostfile /home/santvel/BLCR_MVAPICH2/MVAPICH2/bin/hosts MV2_CKPT_FILE=/home/santvel/TEST/avt/temp MV2_CKPT_INTERVAL=1 MV2_CKPT_MAX_SAVE_CKPTS=3 /home/santvel/TEST/TEST 
D
uring the execution 
of the program 
was set up 
several auto 
checkpoints. 

After 
the crash, 
I 
tried to restore 
the program 
from one 
of the checkpoints.

cr_restart /home/santvel/TEST/avt/temp.1.auto 
and got an error message
:

[mpirun_ckpt.c:680] BLCR call cr_poll_checkpoint() failed with error 2363: Request is invalid across a restart 
when attempting to restore the program from other 
checkpoints
. 
I got 
an error message 
too.


how to fix 
this error and 
restore 
the program 
after a crash
?


Kind regards, Alexandr.






-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20110717/9b132537/attachment-0001.html


More information about the mvapich-discuss mailing list