[mvapich-discuss] mvapich2-0.9.8 blacs problems (Another)
Bas van der Vlies
basv at sara.nl
Wed Mar 28 05:41:55 EDT 2007
amith rajith mamidala wrote:
> Hi Bas,
>
> Attached is the patch which resolves this error with respect to the
> number of communicators/groups that are created. The
> program runs fine with the attached patch. We are also investigating this in more depth and
> will get back to you if we see any more issues,
>
Amith,
We just tested the patch and run some other programms without any
errors or other side effects.
Thanks for the patches.
>
> On Mon, 26 Mar 2007, Bas van der Vlies wrote:
>
>> Hello,
>>
>> We still have problems with mvapich2 0.9.8 + patches and blacs. Here
>> is another file attached, build/run command:
>>
>> {{{
>> mpif90 -o pdgemr2dtest.$brand -ff2c -Wall -g pdgemr2dtest.f90
>> -lscalapack -lfblacs -lcblacs -lblacs -llapack -latlas
>>
>> echo 310 16 1000 | mpiexec -n $nprocs <program_name>
>> }}}
>>
>>
>> The problems always occurs if we do many loops. It will consume more and
>> more memory and then it crash with the following error:
>> {{{
>>
>> loop n mb nprocs npcol nprow 83 310 16 8 4 2
>> loop n mb nprocs npcol nprow 84 310 16 8 4 2
>> loop n mb nprocs npcol nprow 85 310 16 8 4 2
>> rank 7 in job 1 ib-r6n18.irc.sara.nl_7000 caused collective abort of
>> all ranks
>> exit status of rank 7: killed by signal 9
>> rank 6 in job 1 ib-r6n18.irc.sara.nl_7000 caused collective abort of
>> all ranks
>> exit status of rank 6: killed by signal 9
>> rank 4 in job 1 ib-r6n18.irc.sara.nl_7000 caused collective abort of
>> all ranks
>> exit status of rank 4: killed by signal 9
>> }}}
>>
>>
>>
>> --
>> ********************************************************************
>> * *
>> * Bas van der Vlies e-mail: basv at sara.nl *
>> * SARA - Academic Computing Services phone: +31 20 592 8012 *
>> * Kruislaan 415 fax: +31 20 6683167 *
>> * 1098 SJ Amsterdam *
>> * *
>> ********************************************************************
>>
>
>
> ------------------------------------------------------------------------
>
> Index: create_2level_comm.c
> ===================================================================
> --- create_2level_comm.c (revision 1120)
> +++ create_2level_comm.c (working copy)
> @@ -163,6 +163,8 @@
> else{
> comm_ptr->shmem_coll_ok = 0;
> free_2level_comm(comm_ptr);
> + MPI_Group_free(&subgroup1);
> + MPI_Group_free(&comm_group);
> }
>
--
********************************************************************
* *
* Bas van der Vlies e-mail: basv at sara.nl *
* SARA - Academic Computing Services phone: +31 20 592 8012 *
* Kruislaan 415 fax: +31 20 6683167 *
* 1098 SJ Amsterdam *
* *
********************************************************************
More information about the mvapich-discuss
mailing list