This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:172 [2018/09/24 19:04] hmeij07 |
cluster:172 [2018/09/25 14:39] hmeij07 [Finish] |
||
---|---|---|---|
Line 37: | Line 37: | ||
# download runfiles from https:// | # download runfiles from https:// | ||
# files in / | # files in / | ||
- | sh cuda_name_of_runfile | + | sh cuda_9.2.148_396.37_linux.run |
- | sh cuda_name_of_runfile_patch | + | |
Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? | ||
Line 369: | Line 369: | ||
- | To do another node, the steps are | + | To do another node, the steps are NOT WORKING! |
+ | Trying n36 with cuda rpm (local) | ||
- | * add node in deploy.txtof n37.chroot/ | + | * add node in deploy.txt of n37.chroot/ |
* ./ | * ./ | ||
* scp in place passwd, shadow, group, hosts, fstab from global archive | * scp in place passwd, shadow, group, hosts, fstab from global archive | ||
* umount -a | * umount -a | ||
* ONBOOT=no, ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+ | * ONBOOT=no, ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+ | ||
- | * reboot then check polkit user … screws up systemd-logind | + | * bootlocal=EXIT then reboot then check polkit user … screws up systemd-logind |
* hostnamectl set-hostname node_name (logout/ | * hostnamectl set-hostname node_name (logout/ | ||
- | * tar in place n37.chroot.ul.tar.gz in / | + | * eth1 on 129.133 |
+ | * rpm -i kernel-devel | ||
+ | * rpm -i / | ||
* Nvidia install: files in / | * Nvidia install: files in / | ||
* sh cuda_name_of_runfile | * sh cuda_name_of_runfile | ||
- | * sh cuda_name_of_runfile_patch | + | * nvidia-modprobe.sh |
\\ | \\ | ||
**[[cluster: | **[[cluster: |