[mvapich-discuss] Problem with more MPI jobs on the same node

Emir Imamagic eimamagi at srce.hr
Sat Aug 29 17:25:04 EDT 2009


Krishna Chaitanya Kandalla wrote:
> 2. Optionally, you can then hit the "o" key, hold the shift and the j 
> keys so that the "J" and the "A" fields are juxtaposed - this will be 
> easier to compare visually.

top - 23:24:01 up 56 days, 17:56,  3 users,  load average: 15.17, 7.18, 2.78
Tasks: 493 total,  17 running, 476 sleeping,   0 stopped,   0 zombie
Cpu(s): 25.0%us,  0.1%sy,  0.0%ni, 74.9%id,  0.0%wa,  0.0%hi,  0.0%si, 
0.0%st
Mem:  66072240k total, 10470960k used, 55601280k free,   336688k buffers
Swap:  7999992k total,        0k used,  7999992k free,  7723584k cached

  P   PID USER      PR  NI  VIRT  SHR  RES %CPU %MEM S    TIME+  COMMAND
  0  1267 eimamagi  25   0  164m  11m 117m 50.2  0.2 R   1:28.06 
lu.C.8.mvapich
  2  1269 eimamagi  25   0  164m 5092 111m 50.2  0.2 R   1:28.38 
lu.C.8.mvapich
  5  1272 eimamagi  25   0  164m 4988 111m 50.2  0.2 R   1:28.05 
lu.C.8.mvapich
  6  1273 eimamagi  25   0  164m 4220 110m 50.2  0.2 R   1:28.06 
lu.C.8.mvapich
  1  1338 eimamagi  25   0  164m 5040 111m 50.2  0.2 R   1:28.45 
lu.C.8.mvapich
  3  1382 eimamagi  25   0  164m 3352 109m 50.2  0.2 R   1:28.09 
lu.C.8.mvapich
  4  1383 eimamagi  25   0  164m 3184 109m 50.2  0.2 R   1:28.15 
lu.C.8.mvapich
  7  1386 eimamagi  25   0  164m 3356 109m 50.2  0.2 R   1:28.15 
lu.C.8.mvapich
  1  1268 eimamagi  25   0  164m 4976 111m 49.8  0.2 R   1:27.74 
lu.C.8.mvapich
  3  1270 eimamagi  25   0  164m 3180 109m 49.8  0.2 R   1:28.10 
lu.C.8.mvapich
  4  1271 eimamagi  25   0  164m 3124 109m 49.8  0.2 R   1:28.04 
lu.C.8.mvapich
  7  1274 eimamagi  25   0  164m 3744 109m 49.8  0.2 R   1:28.04 
lu.C.8.mvapich
  0  1337 eimamagi  25   0  164m  11m 117m 49.8  0.2 R   1:28.11 
lu.C.8.mvapich
  2  1381 eimamagi  25   0  164m 4956 111m 49.8  0.2 R   1:27.75 
lu.C.8.mvapich
  5  1384 eimamagi  25   0  164m 4992 111m 49.8  0.2 R   1:28.13 
lu.C.8.mvapich
  6  1385 eimamagi  25   0  164m 4232 110m 49.8  0.2 R   1:28.14 
lu.C.8.mvapich

ID's of cores which are used are consistent with output of mpstat below.

Here's also output of mpstat -P ALL 2 5 which nicely shows what's going on:

23:21:45     CPU   %user   %nice    %sys %iowait    %irq   %soft  %steal 
   %idle    intr/s
23:21:47     all   25.01    0.00    0.05    0.00    0.00    0.00    0.00 
   74.94   1004.00
23:21:47       0  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00   1004.00
23:21:47       1  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       2  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       3  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       4  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       5  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       6  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       7  100.00    0.00    0.00    0.00    0.00    0.00    0.00 
    0.00      0.00
23:21:47       8    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47       9    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      10    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      11    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      12    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      13    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      14    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      15    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      16    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      17    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      18    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      19    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      20    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      21    0.00    0.00    0.50    0.00    0.00    0.00    0.00 
   99.50      0.00
23:21:47      22    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      23    0.50    0.00    1.00    0.00    0.00    0.00    0.00 
   98.51      0.00
23:21:47      24    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      25    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      26    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      27    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      28    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      29    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      30    0.00    0.00    0.00    0.00    0.00    0.00    0.00 
  100.00      0.00
23:21:47      31    0.00    0.00    0.50    0.00    0.00    0.00    0.00 
   99.50      0.00



Thanks,

emir
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 3283 bytes
Desc: S/MIME Cryptographic Signature
Url : http://mail.cse.ohio-state.edu/pipermail/mvapich-discuss/attachments/20090829/0a0dc530/smime.bin


More information about the mvapich-discuss mailing list