This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:172 [2018/09/24 17:14] hmeij07 [Finish] |
cluster:172 [2018/09/26 13:47] hmeij07 |
||
---|---|---|---|
Line 34: | Line 34: | ||
yum install kernel-devel kernel-headers (remove old headers after reboot) | yum install kernel-devel kernel-headers (remove old headers after reboot) | ||
yum install gcc gcc-gfortran gcc-c++ | yum install gcc gcc-gfortran gcc-c++ | ||
+ | |||
+ | # / | ||
+ | # reboot before driver installation # CHROOT done | ||
+ | blacklist nouveau | ||
+ | options nouveau modeset=0 | ||
+ | |||
+ | # new kernel initramfs, load | ||
+ | dracut --force | ||
+ | |||
+ | reboot | ||
+ | |||
# download runfiles from https:// | # download runfiles from https:// | ||
# files in / | # files in / | ||
- | sh cuda_name_of_runfile | + | sh cuda_9.2.148_396.37_linux.run |
- | sh cuda_name_of_runfile_patch | + | |
Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | ||
Line 54: | Line 65: | ||
Install the CUDA 9.2 Samples? | Install the CUDA 9.2 Samples? | ||
(y)es/ | (y)es/ | ||
- | |||
- | # / | ||
- | # reboot before driver installation # CHROOT done | ||
- | blacklist nouveau | ||
- | options nouveau modeset=0 | ||
- | reboot | ||
# nvidia driver | # nvidia driver | ||
./ | ./ | ||
+ | |||
+ | # Device files/ | ||
+ | # They were not | ||
+ | / | ||
# backup | # backup | ||
Line 74: | Line 83: | ||
[root@n37 src]# | [root@n37 src]# | ||
[root@n37 src]# scp n78:/ | [root@n37 src]# scp n78:/ | ||
- | |||
- | # Device files/ | ||
- | # They were not | ||
- | / | ||
- | |||
- | # new kernel initramfs, load | ||
- | dracut --force | ||
# for mapd graphics support needs to be enabled | # for mapd graphics support needs to be enabled | ||
Line 371: | Line 373: | ||
To do another node, the steps are | To do another node, the steps are | ||
- | * add node in deploy.txt | + | * add node in deploy.txt |
- | * ./ | + | * ./ |
- | * | + | * scp in place passwd, shadow, group, hosts, fstab from global archive |
- | copy passwd, shadow, group, hosts, fstab from global archive | + | * umount |
- | check polkit user … screws up systemd-logind | + | * ONBOOT=no, ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+ |
- | connectX mlx4_0 IB interface breaks in CentOS 7.3+ | + | * bootlocal=EXIT then reboot then check polkit user … screws up systemd-logind |
- | unmount NFS mounts while installing nvidia as root | + | |
+ | * hostnamectl set-hostname node_name (logout/ | ||
+ | * eth1 on 129.133 | ||
+ | * yum update | ||
+ | * yum install kernel-headers kernel-devel | ||
+ | * put n37 tarball in /, unpack, remove cuda-9.2 | ||
+ | * reboot | ||
+ | |||
+ | * Nvidia install: files in / | ||
+ | * sh runfile | ||
+ | * reboot (nouveau) | ||
+ | * ./runfile -silent -driver | ||
+ | * reboot | ||
+ | |||
\\ | \\ | ||
**[[cluster: | **[[cluster: |