[mvapich-discuss] messege truncated

nilesh awate nilesh_awate at yahoo.com
Fri Nov 21 08:23:07 EST 2008



Hi Justine,

We are running Pallas over mpi( dapl interconnect), I got the same error while running Pallas with tcp-ip(ethernet) network.

Fatal error in MPI_Recv:
Message truncated, error stack:
MPI_Recv(186)..........................: MPI_Recv(buf=0x7fff23cdd22c, count=976479459, MPI_INT, src=2, tag=1000,MPI_COMM_WORLD, status=0x7fff23cdd210) failed
MPIDI_CH3U_Post_data_receive_found(163): Message from rank 2 and tag 1000 truncated; 4 bytes received but buffersize is -389049460

I am running it over AMD 5 nodes cluster having this (1Ghz Dual-Core AMD Opteron Processor 1216) configuration.

I don't know how MPI_Recv got such a huge count. . .when Pallas is sending max 4194304Bytes

is this some garbage value it receives ?

waiting for reply,

Nilesh


   
 







________________________________
From: Justin <luitjens at cs.utah.edu>
To: nilesh awate <nilesh_awate at yahoo.com>
Cc: Dhabaleswar Panda <panda at cse.ohio-state.edu>; MVAPICH2 <mvapich-discuss at cse.ohio-state.edu>
Sent: Thursday, 20 November, 2008 9:27:42 PM
Subject: Re: [mvapich-discuss] messege truncated

The message means mpi received a message larger than the buffer size you specified.  Namely in this case the buffer length is '-514665432'  thus any length of message would be bigger than it.  What I find odd is the parameters you are sending MPI_Recv.  You are sending a count of '945075466'  are you really sending a message that is a gigabyte in size?  It might be possible that the count is being converted to a signed int causing it to wrap to a negative number.  Check the size that you are specifying for the buffer.  It is odd that you have it specified to be a GB in size when you are only receiving 2 bytes.
nilesh awate wrote:
> 
> Thanks for suggestion (use mvapich2-1.2) sir,
> 
> I have tried the same but still we are facing same problem
> 
> Fatal error in MPI_Recv:
> Message truncated, error stack:
> MPI_Recv(186)........................: MPI_Recv(buf=0x7fff1faf6008, count=945075466, MPI_INT, src=2, tag=1000, MPI_COMM_WORLD, status=0x7fff1faf5fe0) failed
> MPIDI_CH3U_Request_unpack_uebuf(590): Message truncated; 4 bytes received but buffer size is -514665432
> rank 0 in job 4  test01_52519   caused collective abort of all ranks
> exit status of rank 0: killed by signal 9
> 
> is there any suggestion ?
> 
> what does this error mean mean ?
> 
> is this a result of data curruption/packet missing, or something else ?
> 
> wating for reply
> Nilesh Awate
> 
> 
> 
> ------------------------------------------------------------------------
> *From:* Dhabaleswar Panda <panda at cse.ohio-state.edu>
> *To:* nilesh awate <nilesh_awate at yahoo.com>
> *Cc:* MVAPICH2 <mvapich-discuss at cse.ohio-state.edu>
> *Sent:* Wednesday, 19 November, 2008 9:27:36 PM
> *Subject:* Re: [mvapich-discuss] messege truncated
> 
> MVAPICH2 1.2 was released around two weeks back. Can you try the latest
> version.
> 
> DK
> 
> On Wed, 19 Nov 2008, nilesh awate wrote:
> 
> > Hi all,
> I  am using  mvapich2-1.0.3  with  dapl  interconnect (its a proprietary  nic & dapl library)
> I got following error while running pallas over (amd dual core) 5 nodes cluster.
> 
> Fatal error in MPI_Recv:
> Message truncated, error stack:
> MPI_Recv(186)..........................: MPI_Recv(buf=0x7fff24744cec, count=952788905, MPI_INT, src=2, tag=1000,MPI_COMM_WORLD, status=0x7fff24744cd0) failed
> MPIDI_CH3U_Post_data_receive_found(243): Message from rank 2 and tag 1000 truncated; 4 bytes received but buffersize is -483811676
> rank 0 in job 2  test01_40634  caused collective abort of all ranks
>   exit status of rank 0: killed by signal 9
> 
> 
> will you suggest where we should look for solving above error ?
> what can we interpret from above message ?
> 
> wating for reply
> thanking
> Nilesh
> 
> 
>       Bring your gang together. Do your thing. Find your favourite Yahoo! group at http://in.promos.yahoo.com/groups/
> 
> 
> ------------------------------------------------------------------------
> Add more friends to your messenger and enjoy! Invite them now. <http://in.rd.yahoo.com/tagline_messenger_6/*http://messenger.yahoo.com/invite/> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
>  


      Did you know? You can CHAT without downloading messenger. Go to http://in.webmessenger.yahoo.com/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20081121/478ec019/attachment-0001.html


More information about the mvapich-discuss mailing list