User Tools

Site Tools


cluster:172

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
cluster:172 [2018/09/24 18:06]
hmeij07 [Finish]
cluster:172 [2018/09/25 12:30]
hmeij07 [Finish]
Line 37: Line 37:
 # download runfiles from https://developer.nvidia.com/cuda-downloads # download runfiles from https://developer.nvidia.com/cuda-downloads
 # files in /usr/local/src # files in /usr/local/src
-sh cuda_name_of_runfile +sh cuda_9.2.148_396.37_linux.run 
-sh cuda_name_of_runfile_patch+
  
 Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26? Install NVIDIA Accelerated Graphics Driver for Linux-x86_64 396.26?
Line 55: Line 55:
 (y)es/(n)o/(q)uit: n (y)es/(n)o/(q)uit: n
  
-# /etc/modprobe.d/blacklist-nouveau.conf+# /etc/modprobe.d/blacklist-nouveau.conf (new file by nvidia)
 # reboot before driver installation # CHROOT done # reboot before driver installation # CHROOT done
 blacklist nouveau blacklist nouveau
Line 62: Line 62:
  
 # nvidia driver # nvidia driver
-./cuda_name_of_runfile -silent -driver+./cuda_name_of_runfile \-\-silent \-\-accept-eula driver
  
 # backup # backup
Line 371: Line 371:
 To do another node, the steps are To do another node, the steps are
  
-  * add node in deploy.txt +  * add node in deploy.txtof n37.chroot/ 
-  * ./deploy.txt `grep nnode_name deploy.txt`+  * ./deploy.txt `grep node_name deploy.txt`
   * scp in place passwd, shadow, group, hosts, fstab from global archive   * scp in place passwd, shadow, group, hosts, fstab from global archive
   * umount -a   * umount -a
 +  * ONBOOT=no, ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+
   * reboot then check polkit user … screws up systemd-logind   * reboot then check polkit user … screws up systemd-logind
-  * ifdown ib0 ??? connectX mlx4_0 IB interface breaks in CentOS 7.3+ +  * hostnamectl set-hostname node_name (logout/login) 
-  * hostnamectl set-hostname node_name+  * tar in place n37.chroot.ul.tar.gz in / FIRST 
 +  * REMOVE /usr/local/cuda-9.2 then 
 +  * Nvidia install: files in /usr/local/src 
 +    * sh cuda_name_of_runfile 
 +    * nvidia-modprobe.sh
  
  
 \\ \\
 **[[cluster:0|Back]]** **[[cluster:0|Back]]**
cluster/172.txt · Last modified: 2020/07/15 17:52 by hmeij07