This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision Next revision Both sides next revision | ||
cluster:172 [2018/08/22 11:58] hmeij07 [Nvidia] |
cluster:172 [2018/08/22 13:46] hmeij07 [Nvidia] |
||
---|---|---|---|
Line 17: | Line 17: | ||
* copy passwd, shadow, group, hosts, fstab from global archive | * copy passwd, shadow, group, hosts, fstab from global archive | ||
* check polkit user ... screws up systemd-logind | * check polkit user ... screws up systemd-logind | ||
- | * connextX | + | * connectX |
+ | * unmount NFS mounts while installing nvidia as root | ||
+ | * install other software as regular user | ||
==== Nvidia ==== | ==== Nvidia ==== | ||
+ | |||
+ | ** Installation ** | ||
< | < | ||
Line 29: | Line 33: | ||
yum update kernel kernel-tools kernel-tools-libs | yum update kernel kernel-tools kernel-tools-libs | ||
yum install kernel-devel kernel-headers (remove old headers after reboot) | yum install kernel-devel kernel-headers (remove old headers after reboot) | ||
- | yum install gcc gcc-devel gcc-gfortran gcc-gfortran-devel | + | yum install gcc gcc-devel gcc-gfortran gcc-c++ |
# download runfiles from https:// | # download runfiles from https:// | ||
Line 82: | Line 86: | ||
* export PATH=/ | * export PATH=/ | ||
* export LD_LIBRARY_PATH=/ | * export LD_LIBRARY_PATH=/ | ||
+ | |||
+ | **Verification** | ||
+ | |||
+ | < | ||
+ | |||
+ | [root@n37 cuda-9.2]# / | ||
+ | / | ||
+ | |||
+ | CUDA Device Query (Runtime API) version (CUDART static linking) | ||
+ | |||
+ | Detected 4 CUDA Capable device(s) | ||
+ | |||
+ | Device 0: "Tesla K20m" | ||
+ | CUDA Driver Version / Runtime Version | ||
+ | CUDA Capability Major/Minor version number: | ||
+ | ... | ||
+ | > Peer access from Tesla K20m (GPU0) -> Tesla K20m (GPU1) : Yes | ||
+ | > Peer access from Tesla K20m (GPU0) -> Tesla K20m (GPU2) : No | ||
+ | > Peer access from Tesla K20m (GPU0) -> Tesla K20m (GPU3) : No | ||
+ | > Peer access from Tesla K20m (GPU1) -> Tesla K20m (GPU0) : Yes | ||
+ | > Peer access from Tesla K20m (GPU1) -> Tesla K20m (GPU2) : No | ||
+ | > Peer access from Tesla K20m (GPU1) -> Tesla K20m (GPU3) : No | ||
+ | > Peer access from Tesla K20m (GPU2) -> Tesla K20m (GPU0) : No | ||
+ | > Peer access from Tesla K20m (GPU2) -> Tesla K20m (GPU1) : No | ||
+ | > Peer access from Tesla K20m (GPU2) -> Tesla K20m (GPU3) : Yes | ||
+ | > Peer access from Tesla K20m (GPU3) -> Tesla K20m (GPU0) : No | ||
+ | > Peer access from Tesla K20m (GPU3) -> Tesla K20m (GPU1) : No | ||
+ | > Peer access from Tesla K20m (GPU3) -> Tesla K20m (GPU2) : Yes | ||
+ | |||
+ | deviceQuery, | ||
+ | CUDA Runtime Version = 9.2, NumDevs = 4, | ||
+ | Device0 = Tesla K20m, Device1 = Tesla K20m, | ||
+ | Device2 = Tesla K20m, Device3 = Tesla K20m | ||
+ | Result = PASS | ||
+ | |||
+ | </ | ||
+ | |||
+ | ** BandWithTest ** | ||
+ | |||
+ | < | ||
+ | |||
+ | [root@n37 cuda-9.2]# / | ||
+ | [CUDA Bandwidth Test] - Starting... | ||
+ | Running on... | ||
+ | |||
+ | | ||
+ | Quick Mode | ||
+ | |||
+ | Host to Device Bandwidth, 1 Device(s) | ||
+ | | ||
+ | | ||
+ | | ||
+ | |||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | |||
+ | | ||
+ | | ||
+ | | ||
+ | | ||
+ | |||
+ | Result = PASS | ||
+ | |||
+ | </ | ||
+ | |||
+ | ** Finish ** | ||
+ | |||
+ | * yum install freeglut-devel libX11-devel libXi-devel libXmu-devel \ make mesa-libGLU-devel | ||
+ | * check for / | ||
+ | * [root@n37 /]# tar -cvf / | ||
+ | * [root@n37 /]# scp / | ||
+ | |||
+ | ==== Amber 16 ==== | ||
+ | |||
+ | < | ||
+ | # As root check requirements | ||
+ | rpm -qa | grep ^gcc | ||
+ | rpm -qa | grep ^g++ | ||
+ | rpm -qa | grep ^flex | ||
+ | rpm -qa | grep ^tcsh | ||
+ | rpm -qa | grep ^zlib | ||
+ | rpm -qa | grep ^zlib-devel | ||
+ | rpm -qa | grep ^bzip2 | ||
+ | rpm -qa | grep ^bzip2-devel | ||
+ | rpm -qa | grep ^bzip | ||
+ | rpm -qa | grep ^bzip-devel | ||
+ | rpm -qa | grep ^libXt | ||
+ | rpm -qa | grep ^libXext | ||
+ | rpm -qa | grep ^libXdmcp | ||
+ | rpm -qa | grep ^tkinter # weird one need python 2.6.6_something | ||
+ | rpm -qa | grep ^openmpi | ||
+ | rpm -qa | grep ^perl | egrep " | ||
+ | rpm -qa | grep ^patch | ||
+ | rpm -qa | grep ^bison | ||
+ | |||
+ | # As root install missing | ||
+ | yum install flex bzip2-devel libXdmcp zlib zlib-devel | ||
+ | yum install tkinter openmpi perl-ExtUtils-MakeMaker patch bison | ||
+ | |||
+ | </ | ||
+ | |||
\\ | \\ | ||
**[[cluster: | **[[cluster: |