This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
cluster:200 [2021/01/14 20:20] hmeij07 |
cluster:200 [2021/02/18 18:33] (current) hmeij07 |
||
---|---|---|---|
Line 1: | Line 1: | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: | ||
+ | |||
+ | Update | ||
+ | --- // | ||
+ | |||
+ | |||
+ | ---- | ||
+ | |||
+ | For CUDA_ARCH (or '' | ||
+ | |||
+ | ---- | ||
+ | |||
+ | A detailed review and comparison of GEForce gpus, including the Quadro RTX 5000 and RTX 2080 (Ti and S) can be found at this[[https:// | ||
+ | |||
+ | * Noteworthy re RTX2080S | ||
+ | * VendorB "RTX 2080 Super are EOL" | ||
+ | * VendorA "neigh impossible to obtain any of them" | ||
+ | * Noteworthy re RTX3060Ti | ||
+ | * VendorB "The 3060 Ti does not have the proper cooling for data center use and are not built for that environment" | ||
+ | * VendorA "Lead times on the new GPUs are generally 2 months or more." | ||
+ | |||
+ | ^ VendorB1 | ||
+ | | ||| ||| ||| | ||
+ | ^ Head Node | ||
+ | | Rack | 1U | | 1U ||| same ||| | ||
+ | | Power | 1+1 | 208V | 1+1 ||| same ||| | ||
+ | | Nic | 2x1G+4x10G | ||
+ | | Rails | 25 | | 25-33 ||| same ||| | ||
+ | | CPU | 2x6226R | ||
+ | | cores | 2x16 | Physical | ||
+ | | ghz | 2.9 | | 3.8 ||| same ||| | ||
+ | | ddr4 | 192 | gb | 96 ||| same ||| | ||
+ | | hdd | 2x480G | ||
+ | | centos | 8 | yes | 8 ||| same ||| | ||
+ | | OpenHPC | yes | "best effort" | ||
+ | ^ GPU Compute Node ^^ ^ GPU Compute Node | ||
+ | | Rack | 2U | | 4U ||| same ||| | ||
+ | | Power | 1 | 208V | 1+1 ||| same ||| | ||
+ | | Nic | 2x1G+2x10G | ||
+ | | Rails | ? | | 26-36 ||| same ||| | ||
+ | | CPU | 2x4214R | ||
+ | | cores | 2x12 | Physical | ||
+ | | ghz | 2.4 | | 2.4 ||| same ||| | ||
+ | | ddr4 | 192 | gb | 192 ||| same ||| | ||
+ | | hdd | 480G | < | ||
+ | | centos | 8 | with gpu drivers, toolkit | ||
+ | | GPU | 4x(RTX 5000) | active cooling | ||
+ | | gddr6 | 16 | gb | 16 ||| 24 ||| | ||
+ | ^ | ||
+ | | Switch | 1x(8+1) | ||
+ | | S&H | tbd | | tbd ||| tbd ||| | ||
+ | | Δ | -5 | | ||
+ | |||
+ | |||
+ | * RTX 5000 gpu teraflop compute capacity depends on compute mode | ||
+ | * 0.35 TFLOPS (FP64), 11.2 TFLOPS (FP32), 22.3 TFLOPS (FP16), 178.4 TFLOPS (INT8) | ||
+ | * RTX 6000 gpu teraflop compute capacity depends on compute mode | ||
+ | * 0.51 TFLOPS (FP64), 16.3 TFLOPS (FP32), 32.6 TFLOPS (FP16), 261.2 TFLOPS (INT8) | ||
+ | |||
+ | From NVIDIA' | ||
+ | |||
+ | < | ||
+ | |||
+ | Quadro RTX 5000 vs RTX 2080 | ||
+ | |||
+ | both have effective 14000Mhz GDDR6 | ||
+ | both have 64 ROPS. | ||
+ | |||
+ | 5000 has 16GB vs 2080's 8GB | ||
+ | 5000 has 192 TMU's vs the 2080's 184 | ||
+ | 5000 has 3072 shaders vs the 2080's 2944 | ||
+ | |||
+ | the 5000 has a base clock of 1350 and average boost to 1730 | ||
+ | the 2080 has a base clock of 1515 and average boost to 1710 | ||
+ | the 5000 has 384 tensor cores vs the 2080's 368. | ||
+ | the 5000 has 48 RT cores vs the 2080's 46. | ||
+ | |||
+ | 5000 | ||
+ | Pixel Rate 110.7 GPixel/ | ||
+ | Texture Rate 332.2 GTexel/ | ||
+ | FP16 (half) performance | ||
+ | FP32 (float) performance | ||
+ | FP64 (double) performance | ||
+ | |||
+ | 2080 | ||
+ | Pixel Rate 109.4 GPixel/ | ||
+ | Texture Rate 314.6 GTexel/ | ||
+ | FP16 (half) performance | ||
+ | FP32 (float) performance | ||
+ | FP64 (double) performance | ||
+ | |||
+ | </ | ||
+ | |||
+ | |||
+ | |||
==== Cottontail2 ==== | ==== Cottontail2 ==== |