6.7

Table Of Contents

The Primary and Secondary VMs continuously monitor the status of one another to ensure that Fault

Tolerance is maintained. A transparent failover occurs if the host running the Primary VM fails, in which

case the Secondary VM is immediately activated to replace the Primary VM. A new Secondary VM is

started and Fault Tolerance redundancy is reestablished automatically. If the host running the Secondary

VM fails, it is also immediately replaced. In either case, users experience no interruption in service and no

loss of data.

A fault tolerant virtual machine and its secondary copy are not allowed to run on the same host. This

restriction ensures that a host failure cannot result in the loss of both VMs.

Note You can also use VM-Host affinity rules to dictate which hosts designated virtual machines can run

on. If you use these rules, be aware that for any Primary VM that is affected by such a rule, its associated

Secondary VM is also affected by that rule. For more information about affinity rules, see the vSphere

Resource Management documentation.

Fault Tolerance avoids "split-brain" situations, which can lead to two active copies of a virtual machine

after recovery from a failure. Atomic file locking on shared storage is used to coordinate failover so that

only one side continues running as the Primary VM and a new Secondary VM is respawned automatically.

vSphere Fault Tolerance can accommodate symmetric multiprocessor (SMP) virtual machines with up to

four vCPUs.

Fault Tolerance Use Cases

Several typical situations can benefit from the use of vSphere Fault Tolerance.

Fault Tolerance provides a higher level of business continuity than vSphere HA. When a Secondary VM is

called upon to replace its Primary VM counterpart, the Secondary VM immediately takes over the Primary

VM’s role with the entire state of the virtual machine preserved. Applications are already running, and

data stored in memory does not need to be reentered or reloaded. Failover provided by vSphere HA

restarts the virtual machines affected by a failure.

This higher level of continuity and the added protection of state information and data informs the

scenarios when you might want to deploy Fault Tolerance.

Applications which must always be available, especially applications that have long-lasting client

connections that users want to maintain during hardware failure.

Custom applications that have no other way of doing clustering.

Cases where high availability might be provided through custom clustering solutions, which are too

complicated to configure and maintain.

Another key use case for protecting a virtual machine with Fault Tolerance can be described as On-

Demand Fault Tolerance. In this case, a virtual machine is adequately protected with vSphere HA during

normal operation. During certain critical periods, you might want to enhance the protection of the virtual

machine. For example, you might be running a quarter-end report which, if interrupted, might delay the

vSphere Availability

VMware, Inc. 49