This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
cluster:221 [2023/03/01 16:36] hmeij07 |
cluster:221 [2023/03/14 13:59] (current) hmeij07 |
||
---|---|---|---|
Line 4: | Line 4: | ||
==== Infiniband Monitoring ==== | ==== Infiniband Monitoring ==== | ||
- | The NVIDIA Firmware Tools (MFT) is a toolset to generate a standard or customized NDIVIA firmware image Querying for firmware information. It is required for '' | + | The NVIDIA Firmware Tools (MFT) is a toolset to generate a standard or customized NDIVIA firmware image Querying for firmware information. It is required for '' |
* infiniband-diags | * infiniband-diags | ||
Line 118: | Line 118: | ||
</ | </ | ||
- | Ok, so onwards | + | Ok, so onward |
Download the script and stage in ''/ | Download the script and stage in ''/ | ||
Line 198: | Line 198: | ||
</ | </ | ||
+ | Under load with full power... | ||
+ | < | ||
+ | |||
+ | ibswinfo -d / | ||
+ | ================================================= | ||
+ | SwitchIB Mellanox Technologies | ||
+ | ================================================= | ||
+ | part number | ||
+ | serial number | ||
+ | product name | Scorpion2 IB EDR Unmanaged | ||
+ | revision | ||
+ | ports | 36 | ||
+ | PSID | MT_2640110032 | ||
+ | GUID | 0x900a840300ecde60 | ||
+ | firmware version | ||
+ | ------------------------------------------------- | ||
+ | uptime (d-h: | ||
+ | ------------------------------------------------- | ||
+ | PSU0 status | ||
+ | | ||
+ | | ||
+ | DC power | OK | ||
+ | fan status | ||
+ | power (W) | 27 <--- 27+32=59 units rated typical 122, max 162 | ||
+ | PSU1 status | ||
+ | | ||
+ | | ||
+ | DC power | OK | ||
+ | fan status | ||
+ | power (W) | 32 | ||
+ | ------------------------------------------------- | ||
+ | temperature (C) | 39 <--- one degree higher | ||
+ | max temp (C) | 45 | ||
+ | ------------------------------------------------- | ||
+ | fan status | ||
+ | fan#1 (rpm) | 8337 | ||
+ | fan#2 (rpm) | 7194 | ||
+ | fan#3 (rpm) | 8287 | ||
+ | fan#4 (rpm) | 7045 | ||
+ | fan#5 (rpm) | 8389 | ||
+ | fan#6 (rpm) | 7194 | ||
+ | fan#7 (rpm) | 8441 | ||
+ | fan#8 (rpm) | 7156 | ||
+ | ------------------------------------------------- | ||
+ | |||
+ | </ | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: | ||