Differences

This shows you the differences between two versions of the page.

--- cluster:151 [2016/11/10 20:37]
hmeij07 [Tuning]
+++ cluster:151 [2016/11/30 18:56]
hmeij07 [Installation]
@@ Line 6: / Line 6: @@
 A document for me to recall and make notes of what I read in the manual pages and what needs testing.
-Basically during the Summer of 2016 I investigated if the HPCC could afford enterprise level storage. I wanted 99.999% uptime, snapshots, high availability and other goodies such as parallel NFS. Netapp came the closest but, eh, still at $42K lots of other options show up. The story is detailed here at [[cluster:149|The Storage Problem]]
+Basically during the Summer of 2016 I investigated if the HPCC could afford enterprise level storage. I wanted 99.999% uptime, snapshots, high availability and other goodies such as parallel NFS. Netapp came the closest but, eh, still at $42K lots of other options show up. That story is detailed at [[cluster:149|The Storage Problem]]
 This page is best read from the bottom up.
-==== cluster idea ====
+==== beeGFS cluster idea ====
-  * Storage servers: buy 2 now 4k+4k then 3rd in July 4k?
+  * Storage servers:
+    * buy 2 with each 12x2TB slow disk, Raid 6, 20T usable (clustered, parallel file system)
+      * create 2 6TB volumes on each, quota at 2TB via XFS, 3 users/server
+      * only $HOME changes to ''/mnt/beegfs/home[1|2]'' (migrates ~4.5TB away from /home or ~50%)
+      * create 2 buddymirrors; each with primary on one, secondary on the other server (high availability)
+    * on UPS
+    * on Infiniband
-  * move test users over on 2 nodes, test, only change is $HOME
+  * Client servers:
+    * all compute/login nodes become beegfs clients
-  * Home cluster
+  * Meta servers:
-    * cottontail (mngt+admingiu)
+    * cottontail2 (root meta, on Infiniband) plus n38-n45 nodes (on Infiniband)
-    * 2-3 new units storage (+snapshots/meta backup)
+    * all mirrored (total=9)
-    * cottontail2 meta + n38-n45 meta, all mirrored
+    * cottontail2 on UPS
+  * Management and Monitor servers
+    * cottontail (on UPS, on Infiniband)
+  * Backups (rsnapshot.org via rsync daemons [[cluster:150|Rsync Daemon/Rsnapshot]])
+    * sharptail:/home --> cottontail
+    * serverA:/mnt/beegfs/home1 --> serverB (8TB max)
+    * serverB:/mnt/beegfs/home2 --> serverA (8TB max)
+  * Costs (includes 3 year NBD warranty)
+    * Microway $12,500
+    * CDW
 ==== beegfs-admin-gui ====
@@ Line 25: / Line 44: @@
   * ''cottontail:/usr/local/bin/beegfs-admin-gui''
-==== Resync Data ====
+==== upgrade ====
+  * [[http://www.beegfs.com/content/updating-upgrading-and-versioning/|External Link]]
+  * New feature - High Availability for Metadata Servers (self-healing, transparent failover)
+==== Resync Data #2 ====
+If you have 2 buddymirrors and 2 storage servers each with 2 storage objects, beegfs will write to all primary storage targets even if numtargets is to 1 ... it will use all storage objects so best to numtargets's value equal to the number of primary storage objects. And then of course the content flow from primary to secondary for high availability.
+How does one add a server?
+<code>
+# define storage objects, 2 per server
+[root@petaltail ~]# /opt/beegfs/sbin/beegfs-setup-storage -p /data/lv1/beegfs_storage -s 217 -i 21701 -m cottontail
+[root@petaltail ~]# /opt/beegfs/sbin/beegfs-setup-storage -p /data/lv2/beegfs_storage -s 217 -i 21702 -m cottontail
+[root@swallowtail data]# /opt/beegfs/sbin/beegfs-setup-storage -p /data/lv1/beegfs_storage -s 136 -i 13601 -m cottontail
+[root@swallowtail data]# /opt/beegfs/sbin/beegfs-setup-storage -p /data/lv2/beegfs_storage -s 136 -i 13602 -m cottontail
+[root@cottontail2 ~]# beegfs-df
+METADATA SERVERS:
+TargetID        Pool        Total         Free    %      ITotal       IFree    %
+========        ====        =====         ====    =      ======       =====    =
+         low     122.3GiB     116.6GiB  95%        7.8M        7.6M  98%
+STORAGE TARGETS:
+TargetID        Pool        Total         Free    %      ITotal       IFree    %
+========        ====        =====         ====    =      ======       =====    =
+         low     291.4GiB     164.6GiB  56%       18.5M       18.5M 100%
+         low     291.4GiB     164.6GiB  56%       18.5M       18.5M 100%
+         low     291.2GiB     130.5GiB  45%       18.5M       16.2M  87%
+         low     291.2GiB     130.5GiB  45%       18.5M       16.2M  87%
+# define mirrrogroups
+[root@cottontail2 ~]# beegfs-ctl --addmirrorgroup --primary=21701 --secondary=13601 --groupid=1
+[root@cottontail2 ~]# beegfs-ctl --addmirrorgroup --primary=13602 --secondary=21702 --groupid=2
+[root@cottontail2 ~]# beegfs-ctl --listmirrorgroups
+     BuddyGroupID   PrimaryTargetID SecondaryTargetID
+     ============   =============== =================
+             21701             13601
+             13602             21702
+# define buddygroups, numtargets=1
+[root@cottontail2 ~]# beegfs-ctl --setpattern --buddymirror /mnt/beegfs/home1 --chunksize=512k --numtargets=1
+New chunksize: 524288
+New number of storage targets: 1
+Path: /home1
+Mount: /mnt/beegfs
+[root@cottontail2 ~]# beegfs-ctl --setpattern --buddymirror /mnt/beegfs/home2 --chunksize=512k --numtargets=1
+New chunksize: 524288
+New number of storage targets: 1
+Path: /home2
+Mount: /mnt/beegfs
+# drop /home/hmeij in /mnt/beegfs/home1/hmeij
+[root@petaltail mysql_bak_ptt]# find /data/lv1/beegfs_storage/ -type f | wc -l
+
+[root@petaltail mysql_bak_ptt]# find /data/lv2/beegfs_storage/ -type f | wc -l
+
+[root@swallowtail data]# find /data/lv1/beegfs_storage/ -type f | wc -l
+
+[root@swallowtail data]# find /data/lv2/beegfs_storage/ -type f | wc -l
+
+# with numtargets=1 beegfs still writes to all primary targets found in all buddygroups
+# rebuild test servers with from scratch with numparts=2
+# drop hmeij/ into home1/ and obtain slightly more files (couple of 100s), not double the amount
+# /home/hmeij has 7808 files in it which gets split over primaries but numparts=2 would yield 15,616 files?
+# drop another copy in home2/ and file counts double to circa 7808
+[root@cottontail2 ~]# beegfs-ctl --getentryinfo  /mnt/beegfs/home1
+Path: /home1
+Mount: /mnt/beegfs
+EntryID: 0-583C50A1-FA
+Metadata node: cottontail2 [ID: 250]
+Stripe pattern details:
++ Type: Buddy Mirror
++ Chunksize: 512K
++ Number of storage targets: desired: 2
+[root@cottontail2 ~]# beegfs-ctl --getentryinfo  /mnt/beegfs/home2
+Path: /home2
+Mount: /mnt/beegfs
+EntryID: 1-583C50A1-FA
+Metadata node: cottontail2 [ID: 250]
+Stripe pattern details:
++ Type: Buddy Mirror
++ Chunksize: 512K
++ Number of storage targets: desired: 2
+Source: /home/hmeij 7808 files in 10G
+TargetID        Pool        Total         Free    %      ITotal       IFree    %
+========        ====        =====         ====    =      ======       =====    =
+         low     291.4GiB      63.1GiB  22%       18.5M       18.5M 100%
+         low     291.4GiB      63.1GiB  22%       18.5M       18.5M 100%
+         low     291.2GiB     134.6GiB  46%       18.5M       16.2M  87%
+         low     291.2GiB     134.6GiB  46%       18.5M       16.2M  87%
+[root@cottontail2 ~]# rsync -ac --bwlimit=2500 /home/hmeij /mnt/beegfs/home1/  &
+[root@cottontail2 ~]# rsync -ac --bwlimit=2500 /home/hmeij /mnt/beegfs/home2/  &
+TargetID        Pool        Total         Free    %      ITotal       IFree    %
+========        ====        =====         ====    =      ======       =====    =
+         low     291.4GiB      43.5GiB  15%       18.5M       18.5M 100%
+         low     291.4GiB      43.5GiB  15%       18.5M       18.5M 100%
+         low     291.2GiB     114.9GiB  39%       18.5M       16.1M  87%
+         low     291.2GiB     114.9GiB  39%       18.5M       16.1M  87%
+# first rsync drops roughly 5G in both primaries which then get copied to secondaries.
+# second rsync does the same so both storage servers loose 20G roughly
+# now shut a storage server down and the whole filesystem can still be accessed (HA)
+</code>
+==== Resync Data #1 ====
 [[http://www.beegfs.com/wiki/StorageSynchronization|StorageSynchronization Link]]
@@ Line 36: / Line 170: @@
   * started a full --resyncstorage --mirrorgroupid=101 --timestamp=0
   * got --getentryinfo EntryID for a file in my /mnt/beegfs/home/path/to/file and did the same for the directory the file was located in
-  * did a cat /mnt.beegfs/home/path/to/file on a client (just fine)
+  * did a cat /mnt/beegfs/home/path/to/file on a client (just fine)
   * brought primary storage down
   * redid the cat above (it hangs for a couple of minutes, then displays the file content)
@@ Line 359: / Line 493: @@
   * made easy [[http://www.beegfs.com/wiki/ManualInstallWalkThrough|External Link]]
   * rpms pulled from repository via petaltail in ''greentail:/sanscratch/tmp/beegfs''
+    * ''yum --disablerepo "*" --enablerepo beegfs list available''
+    * use ''yumdownloader''
 <code>

DokuWiki

User Tools

Site Tools

Differences

Page Tools