Warning: Undefined array key "DOKU_PREFS" in /usr/share/dokuwiki/inc/common.php on line 2082
cluster:109 [DokuWiki]

User Tools

Site Tools


cluster:109

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Last revision Both sides next revision
cluster:109 [2013/02/02 14:03]
hmeij [Lammps GPU Testing (EC)]
cluster:109 [2013/02/02 15:30]
hmeij [Lammps GPU Testing (EC)]
Line 10: Line 10:
  
  
-Same tests (12 cpu cores) using lj/cut, eam, lj/expand, and morse: AU.reduced+Same tests (12 cpu cores) using lj/cut, eam, lj/expand, and morse: **AU.reduced**
  
     CPU only 6 mins 1 secs     CPU only 6 mins 1 secs
Line 16: Line 16:
     2 GPUs 1 mins 0 secs (never saw 2nd GPU used, problem set too small?)     2 GPUs 1 mins 0 secs (never saw 2nd GPU used, problem set too small?)
  
-Same tests (12 cpu cores) using a restart file and using gayberne: GB+Same tests (12 cpu cores) using a restart file and using gayberne: **GB**
  
     CPU only 1 hour 5 mins     CPU only 1 hour 5 mins
     1 GPU 5 mins and 15 secs (a 18-19 times peed up)     1 GPU 5 mins and 15 secs (a 18-19 times peed up)
-    2 GPUs 2 mins (see below)+    2 GPUs 2 mins 
 +     
 +Above results seems overall a bit slower that at other vendor, but same pattern. 
 + 
 +Francis's Melt problem set 
 + 
 +^3d Lennard-Jones melt: for 10,000 steps with 32,000 atoms^^^^^^ 
 +|CPU only|  -np 1  |  -np 6 | -np 12  |  -np 24  |  -np 36  | 
 +|loop times|  329s  |  63s  |  39s  |    29s  |  45s  | 
 +|GPU only|  1xK20  |  2xK20 |  3xK20  |  4xK20  |  (-np 1-4 | 
 +|loop times|  28s  |  16s |  11s  |  10s  |    | 
 +^3d Lennard-Jones melt: for 100,000 steps with 32,000 atoms^^^^^^ 
 +|GPU only|  1xK20  |  2xK20 |  3xK20  |  4xK20  |  (-np 1-4)  | 
 +|loop times|  274s  |  162s |  120s  |  98s  |    | 
 + 
 +  * Serial's time of 329s is reduced to 29s for MPI, an 11x speed up 
 +  * GPU's serial time matches MPI -np 24 and can be further reduced to 10s, a 3x speed up 
  
 ===== Lammps GPU Testing (MW) ===== ===== Lammps GPU Testing (MW) =====
cluster/109.txt · Last modified: 2013/10/16 15:13 by hmeij