[mvapich-discuss] Registration cache problem

Subramoni, Hari subramoni.1 at osu.edu
Thu Jul 2 14:20:17 EDT 2020


Hi, Alexander.

We looked at the issue. Your patch does solve the problem. However, we thought the attached patch may be a safer way. We tried it locally and the application seems to run fine. Can you please try it out and see if it solves your issue?

Best,
Hari.

From: alex.i.melnikov at gmail.com <alex.i.melnikov at gmail.com>
Sent: Monday, June 29, 2020 10:36 AM
To: mvapich-discuss at cse.ohio-state.edu <mvapich-discuss at mailman.cse.ohio-state.edu>
Cc: Subramoni, Hari <subramoni.1 at osu.edu>
Subject: Registration cache problem

Perhaps the problem is with the delayed registration. A small patch (in the attachment) solved the problem for me, but I'm not sure of its correctness.
Best regards, Alexander Melnikov.
From: Subramoni, Hari<mailto:subramoni.1 at osu.edu>
Sent: Monday, June 29, 2020 4:46 PM
To: Alexander Melnikov<mailto:alex.i.melnikov at gmail.com> ; mailto:mvapich-discuss at mailman.cse.ohio-state.edu
Cc: Subramoni, Hari<mailto:subramoni.1 at osu.edu>
Subject: RE: [mvapich-discuss] Registration cache problem

Hi, Alexander.

Thanks for reporting the issue. We appreciate it.

We are able to reproduce it. We are looking into it and will get back to you.

Best,
Hari.

From: mvapich-discuss-bounces at cse.ohio-state.edu<mailto:mvapich-discuss-bounces at cse.ohio-state.edu> <mvapich-discuss-bounces at mailman.cse.ohio-state.edu<mailto:mvapich-discuss-bounces at mailman.cse.ohio-state.edu>> On Behalf Of Alexander Melnikov
Sent: Monday, June 29, 2020 1:07 AM
To: mvapich-discuss at cse.ohio-state.edu<mailto:mvapich-discuss at cse.ohio-state.edu> <mvapich-discuss at mailman.cse.ohio-state.edu<mailto:mvapich-discuss at mailman.cse.ohio-state.edu>>
Subject: [mvapich-discuss] Registration cache problem

Hey. Can anyone check out a little test (in the attachment)? On each node, one MPI process starts, a small number of nodes is enough (3-6). We used the MVAPICH2 library versions 2.3.3 and 2.3.4, the compiler - GCC 4.8.5.
Almost all tests fail (bad data in rcv_thread). However, when the registration cache is disabled (MV2_USE_LAZY_MEM_UNREGISTER = 0), the error disappears.


________________________________
[Avast logo]<https://urldefense.com/v3/__https:/www.avast.com/antivirus__;!!KGKeukY!haenv9wShyyLNADsgxEeUuE3Fgde1jCoJR_kolJIMLbcwE_H614A43EBreqORyR-WQ$>

Это сообщение проверено на вирусы антивирусом Avast.
www.avast.com<https://urldefense.com/v3/__https:/www.avast.com/antivirus__;!!KGKeukY!haenv9wShyyLNADsgxEeUuE3Fgde1jCoJR_kolJIMLbcwE_H614A43EBreqORyR-WQ$>


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20200702/9a62cfdb/attachment.html>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: multi-threaded-dreg-patch.txt
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20200702/9a62cfdb/attachment.txt>


More information about the mvapich-discuss mailing list