[mvapich-discuss] mvapich jobs cleanup

Mark Potts potts at hpcapplications.com
Wed Jun 13 09:47:52 EDT 2007


Hi,
    We are observing a number of cases in which MVAPICH-0.9.9
    jobs launched with mpirun_rsh leave stray processes on some
    nodes when the job terminates abnormally.  Those stray
    processes continue to run forever and require recognition
    and killing.

    Is there a reason this happens with MVAPICH, and is there a
    way to prevent it.  This doesn't seem to be the behavior
    that occurs for abnormally terminated Voltaire MPI or Intel
    MPI jobs.
          regards,
-- 
***********************************
 >> Mark J. Potts, PhD
 >>
 >> HPC Applications Inc.
 >> phone: 410-992-8360 Bus
 >>        410-313-9318 Home
 >>        443-418-4375 Cell
 >> email: potts at hpcapplications.com
 >>        potts at excray.com
***********************************


More information about the mvapich-discuss mailing list