6.7
Table Of Contents
- vSphere Availability
- Contents
- About vSphere Availability
- Business Continuity and Minimizing Downtime
- Creating and Using vSphere HA Clusters
- Providing Fault Tolerance for Virtual Machines
- How Fault Tolerance Works
- Fault Tolerance Use Cases
- Fault Tolerance Requirements, Limits, and Licensing
- Fault Tolerance Interoperability
- Preparing Your Cluster and Hosts for Fault Tolerance
- Using Fault Tolerance
- Best Practices for Fault Tolerance
- Legacy Fault Tolerance
- Troubleshooting Fault Tolerant Virtual Machines
- Hardware Virtualization Not Enabled
- Compatible Hosts Not Available for Secondary VM
- Secondary VM on Overcommitted Host Degrades Performance of Primary VM
- Increased Network Latency Observed in FT Virtual Machines
- Some Hosts Are Overloaded with FT Virtual Machines
- Losing Access to FT Metadata Datastore
- Turning On vSphere FT for Powered-On VM Fails
- FT Virtual Machines not Placed or Evacuated by vSphere DRS
- Fault Tolerant Virtual Machine Failovers
- vCenter High Availability
- Plan the vCenter HA Deployment
- Configure the Network
- Configure vCenter HA With the Basic Option
- Configure vCenter HA With the Advanced Option
- Manage the vCenter HA Configuration
- Set Up SNMP Traps
- Set Up Your Environment to Use Custom Certificates
- Manage vCenter HA SSH Keys
- Initiate a vCenter HA Failover
- Edit the vCenter HA Cluster Configuration
- Perform Backup and Restore Operations
- Remove a vCenter HA Configuration
- Reboot All vCenter HA Nodes
- Change the Appliance Environment
- Collecting Support Bundles for a vCenter HA Node
- Troubleshoot Your vCenter HA Environment
- Patching a vCenter High Availability Environment
- Using Microsoft Clustering Service for vCenter Server on Windows High Availability
Host limits In addition to resource reservations, a virtual machine can only be placed
on a host if doing so does not violate the maximum number of allowed
virtual machines or the number of in-use vCPUs.
Feature constraints If the advanced option has been set that requires vSphere HA to enforce
VM to VM anti-affinity rules, vSphere HA does not violate this rule. Also,
vSphere HA does not violate any configured per host limits for fault tolerant
virtual machines.
If no hosts satisfy the preceding considerations, the master host issues an event stating that there are not
enough resources for vSphere HA to start the VM and tries again when the cluster conditions have
changed. For example, if the virtual machine is not accessible, the master host tries again after a change
in file accessibility.
VM and Application Monitoring
VM Monitoring restarts individual virtual machines if their VMware Tools heartbeats are not received
within a set time. Similarly, Application Monitoring can restart a virtual machine if the heartbeats for an
application it is running are not received. You can enable these features and configure the sensitivity with
which vSphere HA monitors non-responsiveness.
When you enable VM Monitoring, the VM Monitoring service (using VMware Tools) evaluates whether
each virtual machine in the cluster is running by checking for regular heartbeats and I/O activity from the
VMware Tools process running inside the guest. If no heartbeats or I/O activity are received, this is most
likely because the guest operating system has failed or VMware Tools is not being allocated any time to
complete tasks. In such a case, the VM Monitoring service determines that the virtual machine has failed
and the virtual machine is rebooted to restore service.
Occasionally, virtual machines or applications that are still functioning properly stop sending heartbeats.
To avoid unnecessary resets, the VM Monitoring service also monitors a virtual machine's I/O activity. If
no heartbeats are received within the failure interval, the I/O stats interval (a cluster-level attribute) is
checked. The I/O stats interval determines if any disk or network activity has occurred for the virtual
machine during the previous two minutes (120 seconds). If not, the virtual machine is reset. This default
value (120 seconds) can be changed using the advanced option das.iostatsinterval.
To enable Application Monitoring, you must first obtain the appropriate SDK (or be using an application
that supports VMware Application Monitoring) and use it to set up customized heartbeats for the
applications you want to monitor. After you have done this, Application Monitoring works much the same
way that VM Monitoring does. If the heartbeats for an application are not received for a specified time, its
virtual machine is restarted.
You can configure the level of monitoring sensitivity. Highly sensitive monitoring results in a more rapid
conclusion that a failure has occurred. While unlikely, highly sensitive monitoring might lead to falsely
identifying failures when the virtual machine or application in question is actually still working, but
heartbeats have not been received due to factors such as resource constraints. Low sensitivity monitoring
results in longer interruptions in service between actual failures and virtual machines being reset. Select
an option that is an effective compromise for your needs.
vSphere Availability
VMware, Inc. 16