[Mvapich-discuss] Crash in mvapich-4h.1

Alex mgs.rus.52 at gmail.com
Wed Jan 7 08:53:01 EST 2026


Hi,

I think there is a bug in MPIDI_IPC_rndv_cts_cb (ipc_control.c). When you
do rendezvous cts you use sreg->status.MPI_SOURCE as destination (later
in MPIDI_IPC_remote_complete) but I'm afraid in 'sending' requests this
field might not be filled properly.
As the results mpi crashes if MPIR_CVAR_CH4_IPC_RNDV_PROTOCOL is set to
auto  (messagie_size>MPIR_CVAR_CH4_IPC_CMA_P2P_THRESHOLD)  | coop | rput
(with number of  ranks > 2 ).

I fixed it locally by propagating correct value to MPI_SOURCE: inserting
        sreq->status.MPI_SOURCE = MPIDIG_REQUEST(sreq, u.send.dest);
before
        MPIDI_IPC_remote_complete(sreq, hdr->ipc_hdr.ipc_type);
around ipc_control.c:139, but you can choose a more 'elegant' way to fix
this.

Cheers,
Alex
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/mvapich-discuss/attachments/20260107/2d0681f1/attachment.html>


More information about the Mvapich-discuss mailing list