[mvapich-discuss] send desc error?

Weikuan Yu weikuan.yu at gmail.com
Fri Sep 26 12:03:48 EDT 2008


Hi, Xie,

 > BTW, mvapich-1.0.3 use mpich2-1.0.5 as the base,
 > mvapich2-1.2rc2 use mpich2-1.0.7,

Not sure how you got this impression. As far as I know, mvapich-1.0.3 is 
based on MPICH version 1.

 > what I want to know is what is the difference of ROMIO
 > in these two version?

MVAPICH1 has its romio based from the original from MPICH1, MVAPICH2 
from MPIPCH2. In addition, ROMIO in MVAPICH1 has added support for 
Lustre ADIO driver.

--Weikuan


Xie Min wrote:
> We are using mvapich2 on an infiniBand cluster, each node has two
> Quad-Core Intel Xeon 64 CPU.
> 
> After install mvapich2-1.0.3, we use NPB 3.3 to do some tests, but at
> least the bt.C.64 cannot run, it will exit with error after
> PMI_Barrier().
> Many tasks print the similar error messages:
> 
> send desc error
> [23] Abort: [] Got completion with error 5, vendor vcode=f9, dest rank
> =  40 (or error 9, vendor code=8a, etc)
> at line 512 in file ibv_channel_manager.c
> 
> We tried mvapich2-1.2rc2, bt.C.64 can run to completion without error.
> Because it seems mvapich2-1.0.3 is a stable version, so I am not sure
> if our runtime environment has some problems.
> 
> We use OpenFabrics 1.3 in the cluster nodes.
> 
> BTW, mvapich-1.0.3 use mpich2-1.0.5 as the base, mvapich2-1.2rc2 use
> mpich2-1.0.7, what I want to know is what is the difference of ROMIO
> in these two version?
> 
> Thanks.
> _______________________________________________
> mvapich-discuss mailing list
> mvapich-discuss at cse.ohio-state.edu
> http://mail.cse.ohio-state.edu/mailman/listinfo/mvapich-discuss
> 

-- 
Weikuan Yu <+> 1-865-574-7990
http://ft.ornl.gov/~wyu/


More information about the mvapich-discuss mailing list