[mvapich-discuss] cm_enable_qp_init_to_rtr error

Sayantan Sur surs at cse.ohio-state.edu
Fri Mar 26 17:42:44 EDT 2010


Hi Steve,

Thanks for your report. This may be related to the on-demand
connection manager in MVAPICH. It could also be some weird IB stack
issue where connection creation fails sometimes. It will be hard to
say which way without more details about the system and the workload
you are running:

1) Which version of OFED are you running? what is your platform?
2) At how many processes do you see this failure?
3) Can the application code be tried out by others to see if they
reproduce this error?
4) Do you see this failure with any other compilers, or is it specific to icc?

Hopefully, with this information we will better understand your problem.

Thanks.

On Fri, Mar 26, 2010 at 11:55 AM, Repsher, Stephen J
<stephen.j.repsher at boeing.com> wrote:
> Hello,
>
> I'm experiencing some random hanging behavior with my application compiled with MVAPICH 1.1 and the Intel 11.1 compiler.  Most of the time there are no errors and the code hangs, but once in a while I get an error like this...
>
> [Rank 33][cm.c: line 398]Failed to modify QP to RTR
> [Rank 33][cm.c: line 582]cm_enable_qp_init_to_rtr failed
>
> Anyone have an idea what this might be related to?
>
> Thanks for your help.
>
> ============================================
> Steve Repsher
> Boeing Defense, Space, & Security - Rotorcraft
> Aerodynamics/CFD
> Phone: (610) 591-1510
> Fax: (610) 591-6263
> stephen.j.repsher at boeing.com
>
>
>
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>
>



-- 
Sayantan Sur

Research Scientist
Department of Computer Science
The Ohio State University.



More information about the mvapich-discuss mailing list