[mvapich-discuss] MVAPICH Error

OShea, Thomas T. THOMAS.T.O'SHEA at saic.com
Fri Aug 3 18:33:20 EDT 2007


Hello again,

 

Thanks for all your help in the past; I've been able to get my code up
and running on a small 32 processor cluster. I'm doing scaling tests and
I ran with an array size of 16x16x16 with 1,2,4,8 and 16 processors and
saw fairly good scaling. When I increased the array sizes to 32x32x32 my
code runs fine for all but the 8 processor case. The odd part is that is
doesn't crash until the 15th iteration, and I'm doing 21 iterations for
each case. Here is the error it produces:

 

ch3_rndvtransfer.c:614: MPIDI_CH3_Get_rndv_push: Assertion
'(get_resp_pkt->seqnum) + 1 == (vc)->seqnum_send' failed.

 

I imagine this will be a pain for me to debug since it takes about 30
minutes to get to the point where it fails. Ever seen this error or have
any idea what might be causing it? Any tips would be greatly
appreciated. 

 

Thanks,

Thomas O'Shea

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20070803/7edae6df/attachment-0001.html


More information about the mvapich-discuss mailing list