Warning: Undefined array key "DOKU_PREFS" in /usr/share/dokuwiki/inc/common.php on line 2082
cluster:121 [DokuWiki]

User Tools

Site Tools


cluster:121

Warning: Undefined array key -1 in /usr/share/dokuwiki/inc/html.php on line 1458

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
cluster:121 [2013/09/16 09:41]
hmeij [Hadoop Summary]
cluster:121 [2013/09/16 11:09]
hmeij
Line 4: Line 4:
 ==== Hadoop Summary ==== ==== Hadoop Summary ====
  
-Our production Hadoop Cluster is based on [[http://www.cloudera.com/content/cloudera/en/home.html|Cloudera]]'s CD3U6 repository.  Here are some details:+Our production Hadoop Cluster is based on [[http://www.cloudera.com/content/cloudera/en/home.html|Cloudera]]'s CD3U6 repository.  Here are some details :
  
   * namenode (that is login node): whitetail.wesleyan.edu   * namenode (that is login node): whitetail.wesleyan.edu
     * whitetail also runs the Hadoop Scheduler and Health Monitor     * whitetail also runs the Hadoop Scheduler and Health Monitor
-      * Health Status [[http://whitetail.wesleyan.edu:50070]] +      *  [[http://whitetail.wesleyan.edu:50070|Health Status]] 
-      * Job Tracker [[http://whitetail.wesleyan.edu:50030]]+      *  [[http://whitetail.wesleyan.edu:50030|Job Tracker]]
     * ssh to it directly or from any of our other tails     * ssh to it directly or from any of our other tails
   * resources: access to 600 GB of memory and 1.75 TB of Hadoop's Distributed File System (HDFS)   * resources: access to 600 GB of memory and 1.75 TB of Hadoop's Distributed File System (HDFS)
Line 15: Line 15:
  
   * HDFS is not backed up!   * HDFS is not backed up!
-  * You must request a  writable work area /userdata/username +    * You must request a  writable work area /userdata/username 
-  * Be sure to down load your results to /home/username+    * Be sure to down load your results to /home/username (that is the regular filesystem)
  
   * Data to be shared (dictionaries, anagrams, etc) can be posted in /shareddata   * Data to be shared (dictionaries, anagrams, etc) can be posted in /shareddata
Line 25: Line 25:
     * python     * python
     * perl (Hadoop::Streaming)     * perl (Hadoop::Streaming)
-    * R+RHadoop (rmr2, rhbase)+    * java (both Oracle in /usr/java and openJDK) 
 +    * R+RHadoop (rmr2, rhdfs, rhbase)
     * Hbase (noSQL database)     * Hbase (noSQL database)
 +      * [[http://whitetail.wesleyan.edu:60010|Master & Zookeepers]]
 +      * [[http://whitetail.wesleyan.edu:9095|Thrift server]]
     * MySQL      * MySQL 
 +      * request a database to be set up for you (limited space)
  
- +  * Note: the permissions are bit weird in HDFS but I think it is sorted out. 
 +    * If this turns into a problem we'll let everybody run as user hdfs ... 
 +  * Note: some http links will not work because they point to the private network 
 +    * If you wish to view them launch firefox from whitetail ...
  
 Other useful pages Other useful pages
Line 37: Line 43:
   * [[cluster:115|Use Hadoop Cluster]]   * [[cluster:115|Use Hadoop Cluster]]
  
 +\\ 
 +**[[cluster:0|Back]]**
cluster/121.txt · Last modified: 2013/09/16 11:09 by hmeij