This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:89 [2010/08/18 21:39] hmeij |
cluster:89 [2010/08/31 19:57] hmeij |
||
---|---|---|---|
Line 34: | Line 34: | ||
Netmask is, finally, 255.255.0.0 (excluding public 129.133 subnet). | Netmask is, finally, 255.255.0.0 (excluding public 129.133 subnet). | ||
+ | |||
+ | ===== Infiniband ===== | ||
+ | |||
+ | [[http:// | ||
+ | |||
+ | * Voltaire 4036 | ||
+ | * 519571-B21 | ||
+ | * Voltaire InfiniBand 4X QDR 36-Port Managed Switch | ||
+ | |||
+ | |||
===== DM380G7 ===== | ===== DM380G7 ===== | ||
- | [[http:// | + | [[http:// |
+ | [[http:// | ||
* Dual power (one to UPS, one to utility, do later) | * Dual power (one to UPS, one to utility, do later) | ||
Line 91: | Line 102: | ||
* /snapshot mount point for snapshot volume ~ 10tb | * /snapshot mount point for snapshot volume ~ 10tb | ||
* /sanscratch mount point for sanscratch volume ~ 5 tb | * /sanscratch mount point for sanscratch volume ~ 5 tb | ||
+ | * (next ones must be 50% empty for cloning to work) | ||
* logical volume LOCALSCRATCH: | * logical volume LOCALSCRATCH: | ||
* logical volumes ROOT/ | * logical volumes ROOT/ | ||
Line 115: | Line 127: | ||
* [[http:// | * [[http:// | ||
* HP iLo probably removes the need for IPMI, consult [[http:// | * HP iLo probably removes the need for IPMI, consult [[http:// | ||
- | | + | |
+ | * hmm, we can power up/off via CMU so perhaps IPMI is not needed nor this ability via SIM and web browser | ||
* is head node the Management server? possibly, needs access to provision and public networks | * is head node the Management server? possibly, needs access to provision and public networks | ||
* we may need a iLo eth? in range ... 192.198.104.x? | * we may need a iLo eth? in range ... 192.198.104.x? | ||
Line 124: | Line 137: | ||
* install monitoring client when building golden image node via CMU GUI | * install monitoring client when building golden image node via CMU GUI | ||
* clone nodes, deploy management agent on nodes | * clone nodes, deploy management agent on nodes | ||
+ | * PXEboot and wake-on-lan must be done manually in BIOS | ||
+ | * pre_reconf.sh (/ | ||
* not sure we can implement CMU HA | * not sure we can implement CMU HA | ||
+ | * collectl/ | ||
* Sun Grid Engine (SGE) | * Sun Grid Engine (SGE) | ||
Line 149: | Line 165: | ||
* Lava. Install from source and evaluate. | * Lava. Install from source and evaluate. | ||
+ | |||
+ | * Location | ||
+ | * remove 2 BSS racks (to pace.edu?), rack #3 & 4 | ||
+ | * add an L6-30 if needed (have 3? check) | ||
+ | * fill remaining 2 BSS racks with 24gb good servers, turn off | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: |