White Papers

Table Of Contents
Dell HPC NFS Storage Solution - High Availability Configurations
Page 7
Server Redundancy
NSS-HA contains a pair of PowerEdge R710 servers. The two servers are configured in
active/passive mode using the Red Hat Cluster Suite which will be described in later sections.
In such a mode, when a server fails the other automatically takes over the service running on
the failed server. Thus a single server failure does not cause loss of service, although a brief
interruption (refer to Section 4.4) may occur while the failover is taking place.
Figure 2 - NSS-HA Configuration
Each PowerEdge R710 server has a Dell PERC H700 internal RAID controller and five local hard
disks. Two disks are configured in RAID 1 with one disk designated as a hot spare for the
operating system image. The two additional disks are configured in RAID 0 and used for swap
space. Each server has a dual port Dell 6Gbps SAS HBA to connect to the external PowerVault
MD3200 storage enclosure. Each server contains either an InfiniBand card or 10 Gigabit
Ethernet card to connect to the compute nodes. The servers have an iDRAC enterprise
management card for out-of-band systems management.
Power redundancy
Each server has dual power supplies. Each power supply in the server is connected to a
different power bus via a power PDU, which avoids a single point of power supply failure. The
configuration also includes two power PDUs for the two power supplies.