[mvapich-discuss] [Checkpoint] BLCR call cr_poll_checkpoint()
failed with error 2363: Request is invalid across a restart
Александр Твеленев
santvel at mail.ru
Sun Jul 17 02:08:27 EDT 2011
Hello group,
I installed BLCR (with configures " --prefix=/home/santvel/BLCR_MVAPICH2/BLCR --enable-static --enable-testsuite") and
MVAPICH2 (with configures "
--with-rdma=gen2 --enable-romio --with-file-system=lustre+nfs --with-blcr=/home/santvel/BLCR_MVAPICH2/BLCR --with-blcr-include=/home/santvel/BLCR_MVAPICH2/BLCR/include --with-blcr-libpath=/home/santvel/BLCR_MVAPICH2/BLCR/library
") in the one node of the claster.
Run my test program.
/home/santvel/BLCR_MVAPICH2/MVAPICH2/bin/mpirun_rsh -np 2 --hostfile /home/santvel/BLCR_MVAPICH2/MVAPICH2/bin/hosts MV2_CKPT_FILE=/home/santvel/TEST/avt/temp MV2_CKPT_INTERVAL=1 MV2_CKPT_MAX_SAVE_CKPTS=3 /home/santvel/TEST/TEST
D
uring the execution
of the program
was set up
several auto
checkpoints.
After
the crash,
I
tried to restore
the program
from one
of the checkpoints.
cr_restart /home/santvel/TEST/avt/temp.1.auto
and got an error message
:
[mpirun_ckpt.c:680] BLCR call cr_poll_checkpoint() failed with error 2363: Request is invalid across a restart
when attempting to restore the program from other
checkpoints
.
I got
an error message
too.
how to fix
this error and
restore
the program
after a crash
?
Kind regards, Alexandr.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20110717/9b132537/attachment-0001.html
More information about the mvapich-discuss
mailing list