This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:167 [2018/06/26 16:48] hmeij07 [CPU vs GPU] |
cluster:167 [2018/06/27 17:18] hmeij07 [CPU vs GPU] |
||
---|---|---|---|
Line 4: | Line 4: | ||
==== CPU vs GPU ==== | ==== CPU vs GPU ==== | ||
- | So the question was raised what does our usage look like between CPU and GPU? I have no idea what the appropriate metrics would be but lets start with comparing the hardware deployed. | + | So the question was raised what does our usage look like between CPU and GPU devices? I have no idea what the appropriate metrics would be but lets start with comparing the hardware deployed. |
+ | |||
+ | * Data is period June 1 to June 25, 2018 (job information data ages out) | ||
+ | * Maybe build monthly script if this turns out to be usable info | ||
+ | * That period covers 600 hours of time | ||
+ | * Assume 99% utilization of cpu core or gpu device | ||
+ | * Available time is measured per cpu core but by gpu device | ||
+ | * There is no good/bad metric | ||
+ | * Never collated such data before | ||
+ | * The GPU usage is based on detecting gpu reservations (gpu= flag) | ||
^ Metric ^ CPU ^ Ratio ^ GPU ^ Notes ^ | ^ Metric ^ CPU ^ Ratio ^ GPU ^ Notes ^ | ||
- | | Device Count | 72 | 3:1 | 24 | cpu all intel, gpu all nvidia | | + | | Device Count | 72 | 3:1 | 24 | cpu all intel, gpu all nvidia | |
- | | Core Count | 1,712 | 1:37.6 | 64,300 | physical | + | | Core Count | 1,192 | 1:54 |
- | | Avail Hours | 7,272,576 | 71.3:1 | 101,952 | total cpu cores, total gpus | | + | | Memory | 7,408 | 51:1 | 144 | GB | |
+ | | Teraflops | 38 | 1.5:1 | 25 | double precision, floating point, theoretical | ||
+ | | Avail Hours | 715,200 | 50:1 | 14,400 | total cpu cores, total gpus | | ||
+ | | Job Count | 2,834 | | ||
+ | | Job Hours | 221, | ||
+ | | Avail Hrs Util% | 31 | 6:1 | 5 | weeping... | | ||
+ | | Avail Hours2 | 561, | ||
+ | | Avail Hrs2 Util% | 39 | 8:1 | 5 | more realistic... | | ||
+ | |||
+ | The logs showing gpu %util confirm the extremely low GPU usage. When concatenating the four gpu %util values int a string, since 01Jan2017, the string ' | ||
**[[cluster: | **[[cluster: |