User Tools

Site Tools


cluster:92

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
Next revision Both sides next revision
cluster:92 [2010/12/09 15:56]
hmeij created
cluster:92 [2010/12/20 20:36]
hmeij
Line 6: Line 6:
 ===== HP Towards Deployment ===== ===== HP Towards Deployment =====
  
-  * +  * job 100,000 mile stone (and 1,000,000 on swallowtail)
  
-  * n21: missing dimms or bad dimmsopen up the blade enclosure.+  * setup scripts to log usagegather stats
  
-  * Clean up /root and archive the training session materials.+  * migrate commercial software to greentail?  matlab, stata
  
-  * Document training session materials.  Done, on itsdoku wiki.+  * buy new intel compilers and cmkl?
  
-  * Full backup of local hard disk (/ and /boot).  Done.+  * run some test jobs via scheduler, some benchmarks
  
-  * Linpack burn in.  Stressing the hardware. Done 12/08/2010.  Results can be found here: [[cluster:91|Linpack]]+  * install Lava (no SGE installed, so lets do this first) 
 + 
 +  * rsync over rest of home directories over break (started 20dec10) 
 +    * without the '--delete' and '--delete-exclude' flags 
 +    * added '--stats' flag 
 + 
 +  * 12/18/2010 set up rsnapshot 
 +    * exercising hourly, daily, weekly, monthly 
 +    * monitor how long this takes 
 + 
 +  * 12/15/2010 copy /share/apps across, see below 
 +    * will have to rsync petaltail:/share/apps/src into /share/backups/petaltail (to pull into greentail:/share/apps/src) 
 + 
 +  * 12/15/20101 connect filer3's home directories on /mnt 
 +    * rsync -vac -k /oldhome/username /home (does the trick, note no trailing slashes) 
 +    * i will rsync everything over on date X and redo on date X+Y, overwrite 
 +    * anticipate a deadline for moving off netapp filer3 permanently (dell cluster reconfig dependent) 
 +    * make sure understand the lack of TSM filesystem backups 
 + 
 +  * CMU license expired, turns out to be a demo license. 
 +    * chasing around hp support for a resolution. 
 + 
 +  * 12/15/2010 n21: missing dimms or bad dimms, open up the blade enclosure. Fixed. 
 + 
 +  * 12/09/10 CMU backup 
 +    * single file daily backup cmu.conf to /usr/local/backups/cmu 
 + 
 +  * 12/09/10 Backup databases (hpsmdb for SIM using postgresql, mysql not installed) 
 +    * not much success, not really needed, can do a rediscover 
 + 
 +  * 12/09/10 Clean up /root and archive the training session materials. 
 + 
 +  * 12/08/10 Linpack burn in.  Stressing the hardware. Done.  Results can be found here: [[cluster:91|Linpack]] 
 + 
 +  * 12/05/10 Document training session materials.  Done, on itsdoku wiki. 
 + 
 +  * 12/03/10 Full backup of local hard disk (/ and /boot).  Done. 
 + 
 +  * 12/03/10 David Holton arrives for training.  Part of "resource for a week" Replaced a disk that was already flagged as "predictive failure" Biggest change in cluster configuration is how the MSA60 volumes were set up.  We destroyed all that and rearranged three volumes across the scsi cables (that is vertical versus horizontal) and applied LVM on top. 
 + 
 +  * 11/25/10 Decommissioned two BSS racks (cluster sharptail) which will be donated Wellesley University.  That freed up six L6-30 circuits.  With James,  three of them were pulled to the area were the Flexible Storage array and Equallogic array are racked.  The other three are dedicated to the HP cluster.  Pulled one Enterprise UPS L6-30 near the HP cluster and turned it on. Three PDUs are lit up, but differ from documentation.  Change the cabling a bit so that both head node power supplies, one side of the MSA60 power supplies, and all switches plus KVM are on the enterprise UPS.  This shall not be interruptable, yea. 
 + 
 +  * 11/15/10 Cluster arrives.  A bit overdue (first ETA was 10/01/10).
  
 \\ \\
 **[[cluster:0|Home]]** **[[cluster:0|Home]]**
cluster/92.txt · Last modified: 2011/03/30 15:59 by hmeij