\\
**[[cluster:0|Back]]**
===== IPoIB =====
Redoing our RHEL5.5 HP Proliant blade servers with CentOS 6.7 using [[cluster:144|Warewulf Golden Image]] provisioning.
Not quite there yet, but I'll document here how Infiniband was installed. These compute nodes are connect to a Voltaire interconnect, and aging quite a bit.
First install a vanilla basic server and load packages you need. Then add the packages for Infiniband.
# Install and reboot
yum groupinstall "Infiniband Support"
yum install infiniband-diags perftest qperf opensm
chkconfig rdma on
# for openhpc
yum install inifinipath-psm
yum install opensm
chkconfig opensm on
yum install tcl tk
yum install infiniband-diags
shutdown -r now
# after reboot
lsmod | grep ib
# and the output (ipoib is the important one)
ib_ipoib 80391 0
ib_ucm 12121 0
ib_uverbs 39106 6 rdma_ucm,ib_ucm
ib_umad 11802 8
ib_cm 36996 3 ib_ipoib,ib_ucm,rdma_cm
mlx4_ib 137138 1
ib_sa 24060 5 ib_ipoib,rdma_ucm,rdma_cm,ib_cm,mlx4_ib
ib_mad 39811 4 ib_umad,ib_cm,mlx4_ib,ib_sa
ib_core 81507 11 ib_ipoib,rdma_ucm,ib_ucm,ib_uverbs,ib_umad,rdma_cm,ib_cm,iw_cm,mlx4_ib,ib_sa,ib_mad
ib_addr 8304 3 rdma_ucm,rdma_cm,ib_core
ipv6 335525 74 ip6t_REJECT,nf_conntrack_ipv6,nf_defrag_ipv6,ib_ipoib,mlx4_ib,ib_addr
mlx4_core 226123 2 mlx4_ib,mlx4_en
Connect the cable(s) to you ports and check that the reboot discovered the ports.
# Check IB ports
for i in `ls /sys/class/infiniband/*/ports/*/state`; do echo $i; cat $i; done
# Your output may vary
/sys/class/infiniband/mlx4_0/ports/1/state
4: ACTIVE
/sys/class/infiniband/mlx4_0/ports/2/state
1: DOWN
# some test commands
ibhosts
iblinkinfo
ibstatus
Edit ''/etc/sysconfig/network-scripts/ifcfg-ib0'', something like
DEVICE=ib0
TYPE=InfiniBand
UUID=eac9f00a-245d-4c88-b56f-1bcb6e6ed933
ONBOOT=yes
NM_CONTROLLED=no
BOOTPROTO=none
HWADDR=80:00:00:48:FE:80:00:00:00:00:00:00:00:02:C9:03:00:07:3B:DD
CONNECTED_MODE=no
IPADDR=10.11.103.31
PREFIX=16
DEFROUTE=node
IPV4_FAILURE_FATAL=yes
IPV6INIT=no
NAME="System ib0"
# then start interface
ifup ib0
# check the route, then mount /home on this interface
route
# Your output may vary
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use Iface
10.11.0.0 * 255.255.0.0 U 0 0 0 ib0
link-local * 255.255.0.0 U 1002 0 0 eth0
link-local * 255.255.0.0 U 1004 0 0 ib0
192.168.0.0 * 255.255.0.0 U 0 0 0 eth0
default greentail 0.0.0.0 UG 0 0 0 eth0
# home from sharptail
10.11.103.42:/home /home nfs defaults 0 0
\\
**[[cluster:0|Back]]**