This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
cluster:171 [2018/08/20 13:03] hmeij07 |
cluster:171 [2018/08/20 13:55] (current) hmeij07 [Step 5] |
||
---|---|---|---|
Line 135: | Line 135: | ||
==== Step 4 ==== | ==== Step 4 ==== | ||
- | Copy the deploy.[sh|txt] files from centos7.chroot to vanilla.chroot, see [[cluster: | + | Create files deploy.[sh|txt] files (attached at bottom of page, I keep them in CHROOT/ |
+ | |||
+ | Ran into UEFI boot loader problems. More on that later. | ||
< | < | ||
# final touches in CHROOT | # final touches in CHROOT | ||
- | cp / | ||
- | |||
# edit deploy.sh (check filesystems, | # edit deploy.sh (check filesystems, | ||
+ | |||
+ | # copy your user and group base in passwd/ | ||
+ | # copy or edit bashrc and fstab in CHROOT/etc, add your NFS mounts | ||
# version | # version | ||
Line 157: | Line 160: | ||
vanilla.chroot | vanilla.chroot | ||
- | + | # configure node, deploy | |
- | # configure node, done on WW 3.6.99 | + | |
# assumes node pxe boots first | # assumes node pxe boots first | ||
+ | |||
cd / | cd / | ||
./deploy.sh `grep ^n37 deploy.txt` | ./deploy.sh `grep ^n37 deploy.txt` | ||
- | |||
ssh n37 reboot | ssh n37 reboot | ||
+ | # once the deploy is on it's way, imaging might take 5 mins or so | ||
+ | # for next node boot to be from local disk, on SMS_server issue | ||
- | # once the deploy is on it's way, imaging might take 5 mins or so | ||
- | # on SMS_server issue | ||
wwsh provision set --bootlocal=EXIT n37 -y | wwsh provision set --bootlocal=EXIT n37 -y | ||
- | |||
- | # test a reboot of node n37 from local disk | ||
</ | </ | ||
Line 177: | Line 177: | ||
==== Step 5 ===== | ==== Step 5 ===== | ||
- | So after imaging and reboot, what do we have? Definitely an imaged node, the partitions have shuffled. And our VERSION file came from the vnfs made from CHROOT. We also have eth0, eth1 and ib0. | + | So after imaging and reboot, what do we have? Definitely an imaged node, the partitions have shuffled |
< | < | ||
Line 193: | Line 193: | ||
/ | / | ||
- | [root@n37 | + | [root@n37 |
Infiniband hardware address can be incorrect! Please read BUGS section in ifconfig(8). | Infiniband hardware address can be incorrect! Please read BUGS section in ifconfig(8). | ||
+ | eth0: flags=4163< | ||
inet 192.168.102.47 | inet 192.168.102.47 | ||
+ | eth1: flags=4163< | ||
inet 10.10.102.47 | inet 10.10.102.47 | ||
+ | ib0: flags=4163< | ||
inet 10.11.103.47 | inet 10.11.103.47 | ||
+ | lo: flags=73< | ||
inet 127.0.0.1 | inet 127.0.0.1 | ||
+ | |||
</ | </ | ||
- | Before the reboot we grabbed passwd/ | + | Then I have a post boot script |
< | < | ||
+ | # make sure my mounts are all ok | ||
[root@n37 ~]# mount -a | [root@n37 ~]# mount -a | ||
- | [root@n37 | + | # staged kernel rpms |
- | [root@n37 rpms]# yum install --tolerant *3.10.0* | + | [root@n37 |
+ | [root@n37 rpms]# yum install --tolerant | ||
... | ... | ||
Line 224: | Line 231: | ||
... | ... | ||
- | # reboot from local disk, do not make a golden image of this, stay on 7.2 till 7.5 | + | # reboot from local disk, do not make a golden image of this |
- | # is fixed, only add non-kernel packages to vanilla.chroot if needed and re-image | + | |
</ | </ | ||
Line 231: | Line 237: | ||
- | As far as I understand it, from CentOS 7.3 and on OHPC/ | + | As far as I understand it, from CentOS 7.3 and higher versions, |
* efibootmgr efivar-libs grub2-efi-x64 dosfstools | * efibootmgr efivar-libs grub2-efi-x64 dosfstools | ||
- | My hardware is an ASUS esc4000fdr G2 bought in 2013. In the BIOS boot CSM menus I can set boot filter options to "UEFi and Legacy" | + | My hardware is an ASUS esc4000fdr G2 bought in 2013. In the BIOS boot CSM menus I can set boot filter options to "UEFi and Legacy" |
- | But the CentOS 7.5 image...not. No matter what settings I use to boot UEFI I continue to receive EFI variables not supported. If I boot the 7.5 image Legacy | + | But if I build a CentOS 7.5 image...not. No matter what settings I use to boot UEFI I continue to receive |
+ | |||
+ | The owner of the list thread posted this solution | ||
Line 247: | Line 255: | ||
[[ -n " | [[ -n " | ||
chroot " | chroot " | ||
+ | |||
+ | # Read the OpenHPC thread " | ||
</ | </ | ||
Line 314: | Line 324: | ||
</ | </ | ||
+ | |||
+ | ==== deploy.txt ==== | ||
+ | |||
+ | < | ||
+ | |||
+ | # n33.chroot type nodes, ASUS type servers with 4x K20 gpus | ||
+ | n37 50: | ||
+ | # more servers ... | ||
+ | </ | ||
\\ | \\ | ||
**[[cluster: | **[[cluster: |