This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:181 [2019/08/07 14:30] hmeij07 [2019 GPU Models] |
cluster:181 [2019/08/12 14:16] hmeij07 |
||
---|---|---|---|
Line 21: | Line 21: | ||
A lot of information comes from this web site [[https:// | A lot of information comes from this web site [[https:// | ||
- | Bench statistics (Nidia GTX 1070 is about 100% baseline) from this web site [[https:// | + | Bench statistics (Nvidia |
Most GPU models come in multiple memory configurations, | Most GPU models come in multiple memory configurations, | ||
Line 27: | Line 27: | ||
This is a handy tool [[https:// | This is a handy tool [[https:// | ||
- | Learn more about the T4 ... the T4 can run in mixed mode (fp32/fp16) and can deliver 65 Tflops. Other modes are INT8 at 130 Tops and INT4 260 Tops. Now at 65 Tflops mixed precision the cost dives to $34/tflop. Amazing. And the wattage is amazing too. | + | Learn more about the T4 ... the T4 can run in mixed mode (fp32/fp16) and can deliver 65 Tflops. Other modes are INT8 at 130 Tops and INT4 260 Tops. Now at 65 Tflops mixed precision the cost dives to $34/tflop. Amazing. And the wattage is amazing too. See the next page for the fp64/fp32 mixed precision mode quandary...[[cluster: |
* [[https:// | * [[https:// | ||
* [[https:// | * [[https:// | ||
- | * [[http:// | + | * [[http:// |
* very interesting peak performance FP32 gpu chart (RTX TITAN and RTX 6000 on top) | * very interesting peak performance FP32 gpu chart (RTX TITAN and RTX 6000 on top) | ||
* [[https:// | * [[https:// | ||
- | From Lammps developer: " | + | From Lammps developer: " |
- | Using half precision in any form for force computations is not advisable." | + | |
+ | From Gromacs web site: " | ||
**Keep track of these** | **Keep track of these** |