[mvapich-discuss] Error - vbuf not correct

Hari Subramoni subramoni.1 at osu.edu
Fri Mar 20 13:36:43 EDT 2015


Hello,

Could you please clarify which version of MVAPICH you are using and the
build options used. Output of mpiname -a will help.

On a different note, I see that you are using nemesis. For best
performance, we recommend that you use the support for OpenFabrics (OFA)
IB/iWARP/RoCE available with the CH3 channel

Please refer to the following section of the userguide for more information
on how to configure MVAPICH2 to use the CH3 channel.

http://mvapich.cse.ohio-state.edu/static/media/mvapich/mvapich2-2.1rc2-userguide.html#x1-110004.4

Thx,
Hari.

On Fri, Mar 20, 2015 at 1:29 PM, Liu Jianyu <jerry_leo at msn.com> wrote:

> Hi,
>
> Recently WRF V3.6.1 aborted with these error messages on OFA
>
>    recv desc error, 10934
>    recv desc error, 10934
>    [5] Abort: vbuf not correct.
>    at line 410 in file src/mpid/ch3/channels/nemesis/netmod/ib/ib_vbuf.c
>
> Tried run WRF on TCP/IP with the same nodes like this without any problems
>
>   MPICH_NEMESIS_NETMOD=tcp  mpirun -np 64 -ppn 8 -hostfile n064 ./wrf.exe
>
> Wondering it may be hardware issue of IB.   But no idea how to identify
> the problem node.
>
> Any comments ?
>
> Appreciating your kindly help
>
> Regards
>
> Jianyu
>
>
>
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mailman.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20150320/8be5cda7/attachment.html>


More information about the mvapich-discuss mailing list