User Tools

Site Tools


cluster:15

Warning: Undefined array key 0 in /usr/share/dokuwiki/inc/html.php on line 1271

Warning: Trying to access array offset on value of type bool in /usr/share/dokuwiki/inc/html.php on line 1164

Warning: Trying to access array offset on value of type bool in /usr/share/dokuwiki/inc/html.php on line 1168

Warning: Trying to access array offset on value of type bool in /usr/share/dokuwiki/inc/html.php on line 1171

Warning: Trying to access array offset on value of type bool in /usr/share/dokuwiki/inc/html.php on line 1172

Warning: Undefined array key 0 in /usr/share/dokuwiki/inc/ChangeLog/ChangeLog.php on line 345

Warning: Undefined array key 1 in /usr/share/dokuwiki/inc/html.php on line 1453

Warning: Undefined array key -1 in /usr/share/dokuwiki/inc/html.php on line 1454

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

cluster:15 [2006/12/20 09:19] (current)
Line 1: Line 1:
 +\\
 +**[[cluster:0|Home]]**
 +
 +
 +====== Scali/Manage  ======
 +
 +  * [[http://www.scali.com]]
 +
 +\\
 +Like Platform/ROCKS (see [[cluster:11|link]]), Scali/Manage is a software suite of tools to manage clusters.  It appears very, very versatile.  Lots of stuff you can do but what attracted my interests in my brief perusals were:
 +\\
 +
 +  * heterogeous clusters (as in, manage the other clsuters on campus ...)
 +
 +  * "golden" image capture and deployment (you can also "roll-back" to previous versions!)
 +
 +  * simultaneously deploys RPM installations (so you can perform entire disk image updates with the "images" or incrementally with RPM packages)
 +
 +
 +  * parallel ssh & file copy support
 +
 +
 +  * Change Management ... this is a biggie, for example, if you were to add a node: all nodes would need updating, this becomes automatic with change management, it'll auto detect what needs updating on other nodes
 +
 +
 +  * Fault Handling and Root Cause Analysis ... also a biggie, know when something breaks before it happens
 +
 +
 +
 +  * Scali/MAnage also handles other servers, server farms, grids and blade racks (so for example, rintintin's image could have been captured and deployed elsewhere, or rolled back after upgrading if unsuccessful)
 +
 +
 +  * java/eclipse based gui and web based client
 +
 +
 +  * it also supports PBS Pro, and MPI libraries and MPI/HA ... that is high availability for HA (reasoning goes like ... if jobs run 30 days and a single node fails and MPI is not HA then the entire job is aborted. So HA provides a pathway for atttempting to finish job while hardware underneath gets replaced).
 +
 +
 +  * And lots more.
 +
 + \\
 +**[[cluster:0|Home]]**
  
cluster/15.txt ยท Last modified: 2006/12/20 09:19 (external edit)