ESX Virtualization

VMware ESXi, vSphere, VMware Backup, Hyper-V... how-to, videos....

Nakivo Backup and Replication - #1 Backup solution for Virtual, physical, cloud, NAS and SaaS

Menu
  • Certification
      • VCP-DCV vSphere 8
          • vcp2024-125.
        • Close
    • Close
  • VMware
    • Configuration Maximums
    • vSphere
      • vSphere 8.0
      • vSphere 7.0
      • vSphere 6.7
      • vSphere 6.5
      • vSphere 6.0
      • Close
    • VMworld
      • VMware EXPLORE 2024
      • VMware EXPLORE 2023
      • VMware EXPLORE 2022
      • VMworld 2019
      • VMworld 2018
      • VMworld 2017
      • VMworld 2016
      • VMworld 2015
      • VMworld 2014
      • VMworld 2013
      • VMworld 2012
      • VMworld 2011
      • Close
    • Close
  • Microsoft
    • Windows Server 2012
    • Windows Server 2016
    • Windows Server 2019
    • Close
  • Categories
    • Tips – VMware, Microsoft and General IT tips and definitions, What is this?, How this works?
    • Server Virtualization – VMware ESXi, ESXi Free Hypervizor, VMware vSphere Server Virtualization, VMware Cloud and Datacenter Virtualization
    • Backup – Virtualization Backup Solutions, VMware vSphere Backup and ESXi backup solutions.
    • Desktop Virtualization – Desktop Virtualization, VMware Workstation, VMware Fusion, VMware Horizon View, tips and tutorials
    • How To – ESXi Tutorials, IT and virtualization tutorials, VMware ESXi 4.x, ESXi 5.x and VMware vSphere. VMware Workstation and other IT tutorials.
    • Free – Free virtualization utilities, ESXi Free, Monitoring and free backup utilities for ESXi and Hyper-V. Free IT tools.
    • Videos – VMware Virtualization Videos, VMware ESXi Videos, ESXi 4.x, ESXi 5.x tips and videos.
    • Home Lab
    • Reviews – Virtualization Software and reviews, Disaster and backup recovery software reviews. Virtual infrastructure monitoring software review.
    • Close
  • Partners
    • NAKIVO
    • StarWind
    • Zerto
    • Xorux
    • Close
  • This Web
    • News
    • ESXi Lab
    • About
    • Advertise
    • Archives
    • Disclaimer
    • PDFs and Books
    • Close
  • Free
  • Privacy policy

HA agent on the host failed – what means the different possible messages?

By Vladan SEGET | Last Updated: February 15, 2017

Shares

It's been long time since I wanted to write for myself the different possible states of HA agent messages, when HA agent on the host failed. So If you find it elsewhere, don't worry, it's possible as there are many VMware resources out there with many competent folks. For my own study purposes I need to have it written on my website. There are different possible states that can occur with the VMware vSphere HA agent. What means different possible messages when one of the hosts inside of HA cluster fails? Depending on the failure scenario there are different alert messages showing. Alerts concerning the HA agent like: Network Partitioned state, Agent unreachable state, Host failed state, network isolated state, uninitilization/initialization state, host failed state.… While some of them are quite self-explanatory, I will try to focus on those which aren't so clear.

In my recent article How to configure VMware High Availability (HA) cluster I walks you through the components which are part of the HA cluster, what are the requirements, and you are able to see the necessary steps for configuring of HA. You had the chance to read the architecture around HA, which had changed in vSphere 5.

HA agent on the host failed – Network partitioned state

Each FDM agent should normally be able to reach other FDM agents installed on other hosts and vcenter, and at the same time be able to reach also the default isolation address (usually default gateway). The master host is unable to communicate through network. The network connection fails, and the heartbeats are OK through the network connection, a second communication channel – datastore heartbeats – is used to find out if the host is dead, or just partitioned.

Common case of network partition can be a stretched cluster across two remote sites. If the WAN connection fails, in the site which master host is not present, there is an election organized to elect a master host. The master host in that site will mark the hosts from the other remote site as partitioned. After the connection between those two sites come back, the host being master became slave again and the hosts states changes. You can click to enlarge.

network-partitionned

A special case is network isolation. It's kind of network partition – it's a partition with only 1 host in it.

One of the resolution paths for the network partitionning state error can be to ensure that in vCenter, the Certificate Checking is enabled. See more details on problem resolutions in this VMware KB: Host shows the vSphere HA status as Network Partitioned.  To check that, go to:

  • Administration > vCenter Server Settings

But before changing that, make sure that you deselect the checkbox for deactivate HA in your cluster.

Host Failed State

  • Host failed, but apparently it can also mean sometimes that host itself is running but the network failed
  • Second communication channel failed as well.

There can be local storage, where are VMs still running…. while the host lost the communication with its shared datastores and network…..

Network Isolated State

  • The host can be in network isolated state if the HA agent cannot reach any other HA agents running on other hosts in the cluster
  • Host is unable to ping isolation address.

Agent Unreachable State

Several reasons:

  • vCenter is unable to contact the master host and the ha agent.
  • All servers in the cluster are in failed state.
  • Unlikely, but possibly this can happens: Watchdog process was unable to restart the ha agent on the host when vCenter server wasn't able to communicate with the agent on the host during the HA disabled and re-enabled process.

Uninitialized Error State

3 possible reasons:

  • Closed firewall ports (8182 – Traffic between hosts for vSphere High Availability – TCP and UDP)
  • All datastores failed (host can't access them)
  • Host does not have an access to the datastore where the HA state information is stored.

Initialization Error State

This error shows that last attempt failed to configure HA. vSphere HA does not monitor VMs state on host.

This small article is no mean to be fully exhaustive and is provided as is. If you detect any obvious errors, just ping me through twitter: www.vladan.fr/twitter

Literature and sources:

  • vSphere Troubleshooting (VMware PFD)
  • VMware Community Forums
  • vSphere Availability Guide – section “How vSphere HA works”
  • Deep Dive on Duncan's blog
  • Troubleshooting VMware High Availability (HA) (1001596).

Other Sources:

  • Comparing VMware HA 4.x and vSphere HA 5.0 (2004401).

Jut a reminder, the FDM log can be found here: /var/log/fdm.log

Shares
Vote !

| Filed Under: Server Virtualization Tagged With: HA agent on the host failed

About Vladan SEGET

This website is maintained by Vladan SEGET. Vladan is as an Independent consultant, professional blogger, vExpert x16, Veeam Vanguard x9, VCAP-DCA/DCD, ESX Virtualization site has started as a simple bookmarking site, but quickly found a large following of readers and subscribers.

Connect on: Facebook. Feel free to network via Twitter @vladan.

Comments

  1. DIENG says

    May 31, 2016 at 11:39 pm

    Bonjour, j’ai un petit soucis avec l’activation de HA dans mon cluster.
    Je vous explique, j’ai deux serveurs (HP)distant avec un SAN EMC². Ils sont dans deux lieux géographiquement différents (Mais interconnecté bien sur).
    Les deux serveurs arrivent à se pinger et à se voir correctement en meme temps le SAN.
    La première fois que j’ai activer le HA, ca passais normalement et les VMs arrivées à redemarrer correctement sur l’autre serveur sans du tout de problème. Un jour, j’ai changé l’adresse IP du Server1 avec. J’avais toujours le HA sans problème. Maintenant, quand j’ai désactivé le HA pour le remettre, j’ai eu des erreurs une fois atteint à 46-48%. Je ne comprends pas du tout et depuis une semaine j’arrive pas à résoudre le problème. Je redémarre et tout mais rien ne se passe et suis carrément bloqué.

    Je vois tout le temps ces erreurs lors de l’activation:

    – Cannot install the vCenter Server agent service. Unknown installer error
    – An error occured while communicating with the remote host
    – Unable to communicate with the remote host, since it is disconnected
    – vCenter Server is unable to find a master vSphere HA agent in cluster (nom cluster)
    – vSphere HA agent for this host has an error : vSphere HA agent cannot be installed or configured

    J’ai vraiment besoin de votre aide.
    Merci en avance

    • Vladan SEGET says

      June 1, 2016 at 7:24 am

      Try disconnecting > reconnecting the host(s). Right click > dicsonnect. Then, Right click > connect. To update vCenter DB and fix broken connection to the host.

Private Sponsors

Featured

  • Thinking about HCI? G2, an independent tech solutions peer review platform, has published its Winter 2023 Reports on Hyperconverged Infrastructure (HCI) Solutions.
  • Zerto: One Platform for Disaster Recovery, Backup & Cloud Mobility: Try FREE Hands-On Labs Today!
Click to Become a Sponsor

Most Recent

  • Veeam Backup & Replication v13 Beta: A Game-Changer with Linux
  • What is Veeam Data Cloud Vault and how it can help SMBs
  • Nakivo Backup and Replication – Malware Scan Feature
  • Zerto 10 U7 released with VMware NSX 4.2 Support
  • XorMon NG 1.9.0 Infrastructure Monitoring – now also with Veeam Backup Support
  • Heartbeat vs Node Majority StarWind VSAN Failover Strategy
  • Vulnerability in your VMs – VMware Tools Update
  • FREE version of StarWind VSAN vs Trial of Full version
  • Commvault’s Innovations at RSA Conference 2025 San Francisco
  • VMware ESXi FREE is FREE again!

Get new posts by email:

 

 

 

 

Support us on Ko-Fi

 

 

Buy Me a Coffee at ko-fi.com

Sponsors

Free Trials

  • DC Scope for VMware vSphere – optimization, capacity planning, and cost management. Download FREE Trial Here.
  • Augmented Inline Deduplication, Altaro VM Backup v9 For #VMware and #Hyper-V – Grab your copy now download TRIAL.

VMware Engineer Jobs

VMware Engineer Jobs

YouTube

…

Find us on Facebook

ESX Virtualization

…

Copyright © 2025 ·Dynamik-Gen · Genesis Framework · Log in