[Columbus] Compilation problem of parallel Columbus (ga-5.0)

Felix Plasser felix.plasser at univie.ac.at
Wed Dec 9 12:06:26 EST 2015


Hi, can you run any parallel MR-CI job at all? Just something small on a 
few cores on one node?

-Felix

On 12/09/2015 05:55 PM, Max Pinheiro Jr wrote:
> Dear Columbus team,
>
> I am trying to run some parallel calculations with Columbus in a Cray 
> linux cluster. I have compiled the parallel version of the code using 
> the ga-5.0 libraries and the intel compiler 16.0 with success. 
> However, when I try to run the set of parallel tests of Columbus 
> during the pciudg.x routine printing out the following error message 
> in the runc.error file:
>
>           starting ci iteration   1
>
> 0:0:ga_ndim: ARRAY NOT ACTIVE:: 0
> (rank:0 hostname:eos01 pid:128811):ARMCI DASSERT fail. 
> src/armci.c:ARMCI_Error():276 cond:0
> Last System Error Message from Task 0:: Inappropriate ioctl for device
> application called MPI_Abort(MPI_COMM_WORLD, 0) - process 0
> Last System Error Message from Task 1:: Inappropriate ioctl for device
> application called MPI_Abort(MPI_COMM_WORLD, 2) - process 1
>
> According to the mailing lists of other codes (NWchem and Molpro), 
> this problem seems to be related with the compilation of the global 
> array libraries but I haven't found a clear solution for this error so 
> far. The configure instructions I have used to compile the global 
> array 5.0 is in line with that suggested by other codes in the case of 
> intel and impi but I am not sure if these flags work also for 
> Columbus. The configure parameters are:
>
> ./configure --with-tcgmsg --prefix=/u/maxjr/codes/ga-5.0 CC=icc 
> F77=ifort --enable-i8 --enable-autodetect --enable-armci-group 
> --with-mpi=/afs/@cell/common/soft/intel/ics2016/impi/5.1.1/intel64
>
> For the Columbus compilation, I am using the following config file:
>
> GREP /bin/grep
> GMAKE make
> TAR /bin/tar
> RANLIB /usr/bin/ranlib
> CPPL cpp -C -E -traditional
> PERL /usr/bin/perl
> BLASLIBRARY -Wl,--start-group 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_intel_ilp64.a 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_core.a 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_sequential.a 
> -Wl,--end-group -lpthread -lm
> LAPACKLIBRARY  -Wl,--start-group 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_intel_ilp64.a 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_core.a 
> /afs/@cell/common/soft/intel/ics2016/mkl/lib/intel64_lin/libmkl_sequential.a 
> -Wl,--end-group -lpthread -lm
> MACHINEID linux64.ifc.byterecl
> GUNZIP /bin/gunzip
> GACOMMUNICATION MPI
> MPI_MAINDIR /afs/@cell/common/soft/intel/ics2016/impi/5.1.1/intel64/lib
> MPI_CC mpiicc
> MPI_LD mpiifort -Wl,-z,muldefs
> MPI_FC mpiifort -Wl,-z,muldefs
> PSCRIPT
> GAVERSION GA50
> MPI_LIBS 
> /afs/@cell/common/soft/intel/ics2016/impi/5.1.1/intel64/lib/libmpi_ilp64.so.4.1 
> -libverbs -libumad
> MPI_STARTUP 
> /afs/@cell/common/soft/intel/ics2016/impi/5.1.1/intel64/bin/mpirun -v 
> -np _NPROC_  _EXE_ _EXEOPTS_
> COLUMBUS /u/maxjr/codes/columbus-7.0/Columbus
> DALTON /u/maxjr/codes/columbus-7.0/Columbus/source/dalton
> COLUMBUSVERSION 7.0
> INSTALLOPTION NOGA_INSTALL
> cpan
> standard
> grad
> parallel
>
> The computer system where I am try to run the calculations has 32 
> Intel Xeon E5-2698 cpus per node and ~120 GB of ram memory per node. 
> So this probably is not a problem of memory restrictions. Anyway, I 
> have tried to run a parallel calculation increasing the maximum ARMCI 
> shared memory by setting the flag ARMCI_DEFAULT_SHMMAX in the 
> submission script, as suggested in some forum discussions of other 
> codes with similar problems, but the calculations fail again in the 
> same step (pciudg.x).
>
> Please, anybody has a suggestion to fix this problem? I will greatly 
> appreciate to resolve this problem.
>
> Thanks & All the best,
>
> Max Pinheiro Jr
> Pos-doctoral researcher at Instituto Tecnológico de Aeronáutica - ITA
>
>
>
>
> _______________________________________________
> Columbus mailing list
> Columbus at lists.osc.edu
> https://lists.osu.edu/mailman/listinfo/columbus

-- 
Felix Plasser
Institute for Theoretical Chemistry
University of Vienna
Währingerstr. 17/404
1090 Wien
felix.plasser at univie.ac.at
http://homepage.univie.ac.at/felix.plasser/
Tel.: +43-1-4277-52757

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.osu.edu/pipermail/columbus/attachments/20151209/aa3f8ca9/attachment.html>


More information about the Columbus mailing list