cluster:196
This is an old revision of the document!
Netdata
We use Zenoss for monitor and alerting the whole HPC. Page can be found here Zenoss
At PEARC20 conference I became aware of Netdata which seems a good tool for our “tails” (login, storage servers for example). Lots of detailed information.
bash <(curl -Ss https://my-netdata.io/kickstart.sh)
Then open port 19999 in firewall for wesleyan.edu.
- hpcmon Zenoss server
- cottontail2 Backup scheduler, centos6 compile env (needs reboot)
- greentail52 /sanscratch nfs server (needs reboot)
- whitetail /lvhomes nfs server (old /home)
- sharptail2dr disaster recovery host for hpcstore
cluster/196.1596056724.txt.gz · Last modified: by hmeij07
