This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:172 [2018/09/24 17:14] hmeij07 [Finish] |
cluster:172 [2018/09/25 18:04] hmeij07 [Finish] |
||
---|---|---|---|
Line 37: | Line 37: | ||
# download runfiles from https:// | # download runfiles from https:// | ||
# files in / | # files in / | ||
- | sh cuda_name_of_runfile | + | sh cuda_9.2.148_396.37_linux.run |
- | sh cuda_name_of_runfile_patch | + | |
Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | ||
Line 55: | Line 55: | ||
(y)es/ | (y)es/ | ||
- | # / | + | # / |
# reboot before driver installation # CHROOT done | # reboot before driver installation # CHROOT done | ||
blacklist nouveau | blacklist nouveau | ||
Line 62: | Line 62: | ||
# nvidia driver | # nvidia driver | ||
- | ./ | + | ./ |
# backup | # backup | ||
Line 371: | Line 371: | ||
To do another node, the steps are | To do another node, the steps are | ||
- | * add node in deploy.txt | + | * add node in deploy.txt |
- | * ./ | + | * ./ |
- | * | + | * scp in place passwd, shadow, group, hosts, fstab from global archive |
- | copy passwd, shadow, group, hosts, fstab from global archive | + | * umount |
- | check polkit user … screws up systemd-logind | + | * ONBOOT=no, ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+ |
- | connectX mlx4_0 IB interface breaks in CentOS 7.3+ | + | * bootlocal=EXIT then reboot then check polkit user … screws up systemd-logind |
- | unmount NFS mounts while installing nvidia as root | + | |
+ | * hostnamectl set-hostname node_name (logout/ | ||
+ | * eth1 on 129.133 | ||
+ | * rpm -i kernel-devel | ||
+ | * rpm -i / | ||
+ | * Nvidia install: files in / | ||
+ | * rpm -i cuda-repo-rhel7-10-0-local-10.0.130-410.48-1.0-1.x86_64.rpm | ||
+ | * yum clean all | ||
+ | * yum install cuda | ||
+ | |||
\\ | \\ | ||
**[[cluster: | **[[cluster: |