This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:89 [2010/08/17 17:55] hmeij |
cluster:89 [2010/08/18 19:21] hmeij |
||
---|---|---|---|
Line 18: | Line 18: | ||
Basically ... | Basically ... | ||
+ | |||
+ | * configure all console port switches with an IP | ||
+ | * depending on switch IP in 192.168.102.x or 10.10.102.x | ||
+ | * voltaire console can be stuffed in either | ||
+ | |||
+ | * head node will be connected to our private network via a two link aggregated ethernet cables in the 10.10.x.y range so current home directories can be mounted somewhere (these dirs will not be available on the back end nodes. | ||
* x.y.z.255 is broadcast | * x.y.z.255 is broadcast | ||
Line 36: | Line 42: | ||
* hostname [[http:// | * hostname [[http:// | ||
* eth0, provision, 192.168.102.254/ | * eth0, provision, 192.168.102.254/ | ||
+ | * do we need a iLo eth? in range 192.168.104.254? | ||
* eth1, data/ | * eth1, data/ | ||
- | * eth2, public, 129.133.1.226/ | + | * eth2, public, 129.133.1.226/ |
- | * eth3, ipmi, 192.168.103.254/ | + | * eth3 (over eth2), ipmi, 192.168.103.254/ |
+ | * see discussion iLo/IPMI under CMU | ||
* ib0, ipoib, 10.10.103.254/ | * ib0, ipoib, 10.10.103.254/ | ||
* ib1, ipoib, 10.10.104.254/ | * ib1, ipoib, 10.10.104.254/ | ||
Line 60: | Line 68: | ||
* Three volumes to start with: | * Three volumes to start with: | ||
- | * home (raid 6, design a backup path, do later), 10 tb | + | * home (raid 6), 10 tb |
- | * apps (raid 6, design a backup path, do later), 1tb | + | * snapshot |
- | * sanscratch (raid 1, no backup), 5 tb | + | * sanscratch (raid 1 or 0, no backup), 5 tb |
* SIM | * SIM | ||
Line 72: | Line 80: | ||
* node names hp000, increment by 1 | * node names hp000, increment by 1 | ||
* eth0, provision, 192.168.102.25(increment by 1)/ | * eth0, provision, 192.168.102.25(increment by 1)/ | ||
+ | * do we need an iLo eth? in range 192.168.104.25(increment by 1) | ||
+ | * CMU wants eth0 on NIC1 and PXEboot | ||
* eth1, data/ | * eth1, data/ | ||
- | * eth2, ipmi, 192.168.103.25(increment by 1)/ | + | * eth2 (over eth1), ipmi, 192.168.103.25(increment by 1)/ |
+ | * see discussion iLo/IPMI under CMU | ||
* ib0, ipoib, 10.10.103.25(increment by 1)/ | * ib0, ipoib, 10.10.103.25(increment by 1)/ | ||
* ib1, ipoib, 10.10.104.25(increment by 1)/ | * ib1, ipoib, 10.10.104.25(increment by 1)/ | ||
Line 100: | Line 111: | ||
* configure automatic event handling | * configure automatic event handling | ||
- | * Cluster Management Utility (CMU)[[http:// | + | * Cluster Management Utility (CMU)[[http:// |
- | * install, configure, monitor | + | * iLo/IPMI |
- | * golden image capture, deploy | + | * HP iLo probably removes the need for IPMI, consult [[http:// |
+ | * well maybe not, IPMI ([[http:// | ||
+ | * is head node the Management server? possibly, needs access to provision and public networks | ||
+ | * we may need a iLo eth? in range ... 192.198.104.x? | ||
+ | * CMU wants eth0 on NIC1 and PXEboot | ||
+ | * install CMU management node | ||
+ | * install X and CMU GUI client node | ||
+ | * start CMU, start client, scan for nodes, build golden image | ||
+ | * clone nodes, deploy management agent on nodes | ||
+ | * install monitoring | ||
* Sun Grid Engine (SGE) | * Sun Grid Engine (SGE) | ||
Line 116: | Line 136: | ||
* where in data center (do later), based on environmental works | * where in data center (do later), based on environmental works | ||
+ | ===== ToDo ===== | ||
+ | |||
+ | All do later. After HP cluster is up. | ||
+ | |||
+ | * Backups. | ||
+ | * Exclude very large files? | ||
+ | * petaltail:/ | ||
+ | * or better [[http:// | ||
+ | |||
+ | * Lava. Install from source and evaluate. | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: |