[mvapich-discuss] handling of RDMA_CM_EVENT_REJECTED event in MVAPICH2-1.2p1

gossips J polk678 at gmail.com
Thu Aug 13 00:27:36 EDT 2009


Hmm got the clarification.
Thanks a lot.

On Wed, Aug 12, 2009 at 5:35 PM, Hari Subramoni <subramon at cse.ohio-state.edu
> wrote:

> Hi Polk,
>
> I'm sorry about saying that the job will be aborted. It will not get
> aborted when we get a RDMA_CM_EVENT_REJECTED. My bad.
>
> Rejecting the connection is done with the knowledge of the src/dest
> process and is not due to some fault in the system.
>
> We reject the connection only when the server and the client try to make a
> connection to each other at the same time. In this secnario, the
> connection request from the server process is rejected by the client.
>
> Hope this clarifies.
>
> Thx,
> Hari.
>
> On Wed, 12 Aug 2009, gossips J wrote:
>
> > Hi Hari,
> > thanks for the response.
> >
> > Yes, this event is considered as CONNECT_ERROR and task aborts but REJECT
> > has no handling so it does nothing.
> >
> > Two question:
> > 1. Why should task gets aborted on Connect Error event. As i can predict
> the
> > side effect would be affecting other connections with same src-dst pair
> in
> > data transfer operation.
> >
> > 2. What happens in case of Reject event? Mvapich simply has DEBUG_PRINT
> and
> > break out of switch case.
> >
> > Thanks,
> > Polk.
> >
> > On Tue, Aug 11, 2009 at 6:33 PM, Hari Subramoni <
> subramon at cse.ohio-state.edu
> > > wrote:
> >
> > > Hi Polk,
> > >
> > > The RDMA_CM_EVENT_REJECTED event is being handled inside
> mvapich2-1.2p1.
> > > On receiving this event, the task will abort.
> > >
> > > This is done in 'src/mpid/ch3/channels/mrail/src/gen2/rdma_cm.c' in the
> > > function 'ib_cma_event_handler'.
> > >
> > > Please let us know if you have any further questions.
> > >
> > > Thx,
> > > Hari.
> > >
> > > On Tue, 11 Aug 2009, gossips J wrote:
> > >
> > > > Hi,
> > > >
> > > > It looks like there is no handling for RDMA_CM_EVENT_REJECTED event
> in ""
> > > > file.
> > > >
> > > > There has to be some sort of handling, isnt it?
> > > >
> > > > Is this feature (CM_REJECT) supported in mvapich2-1.2p1 over OFA???
> > > >
> > > > In current scenario mvapich2 simply stuck during such event since
> there
> > > is
> > > > no handling in MPI (mvapich2).
> > > >
> > > > Thoughts???
> > > >
> > > > Thanks,
> > > > Polk.
> > > >
> > >
> > >
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20090813/88261c67/attachment.html


More information about the mvapich-discuss mailing list