ESX Virtualization

VMware ESXi, vSphere, VMware Backup, Hyper-V... how-to, videos....

Nakivo Backup and Replication - #1 Backup solution for Virtual, physical, cloud, NAS and SaaS

Menu
  • Certification
      • VCP-DCV vSphere 8
          • vcp2024-125.
        • Close
    • Close
  • VMware
    • Configuration Maximums
    • vSphere
      • vSphere 8.0
      • vSphere 7.0
      • vSphere 6.7
      • vSphere 6.5
      • vSphere 6.0
      • Close
    • VMworld
      • VMware EXPLORE 2024
      • VMware EXPLORE 2023
      • VMware EXPLORE 2022
      • VMworld 2019
      • VMworld 2018
      • VMworld 2017
      • VMworld 2016
      • VMworld 2015
      • VMworld 2014
      • VMworld 2013
      • VMworld 2012
      • VMworld 2011
      • Close
    • Close
  • Microsoft
    • Windows Server 2012
    • Windows Server 2016
    • Windows Server 2019
    • Close
  • Categories
    • Tips – VMware, Microsoft and General IT tips and definitions, What is this?, How this works?
    • Server Virtualization – VMware ESXi, ESXi Free Hypervizor, VMware vSphere Server Virtualization, VMware Cloud and Datacenter Virtualization
    • Backup – Virtualization Backup Solutions, VMware vSphere Backup and ESXi backup solutions.
    • Desktop Virtualization – Desktop Virtualization, VMware Workstation, VMware Fusion, VMware Horizon View, tips and tutorials
    • How To – ESXi Tutorials, IT and virtualization tutorials, VMware ESXi 4.x, ESXi 5.x and VMware vSphere. VMware Workstation and other IT tutorials.
    • Free – Free virtualization utilities, ESXi Free, Monitoring and free backup utilities for ESXi and Hyper-V. Free IT tools.
    • Videos – VMware Virtualization Videos, VMware ESXi Videos, ESXi 4.x, ESXi 5.x tips and videos.
    • Home Lab
    • Reviews – Virtualization Software and reviews, Disaster and backup recovery software reviews. Virtual infrastructure monitoring software review.
    • Close
  • Partners
    • NAKIVO
    • StarWind
    • Zerto
    • Xorux
    • Close
  • This Web
    • News
    • ESXi Lab
    • About
    • Advertise
    • Archives
    • Disclaimer
    • PDFs and Books
    • Close
  • Free
  • Privacy policy

VMware VSAN Resynchronization and Rebalance Operations – New PDF

By Vladan SEGET | Last Updated: February 21, 2020

Shares

There is new PDF from VMware targeted for VSAN users, architects and IT admins. Recent change in VMware HCL concerning the queue depth of storage adapters (now there are only the ones with queue depth 256 and higher – see the ones are concerned in this VMware KB ) brought more questions concerning availability and performance during resynchronization and rebalance operations.

The paper is called VMware Virtual SAN Data Management Operations and it is a good read because it explains few things that I think many of the folks out there aren't aware. The capacity and performance of VSAN I think that everyone understood so far, how it works during normal operations when everything goes fine.

But when component failure occurs the song is different, because if the storage controller card has queue depth lower than 256 during the resynchronization or rebalance operations the running VMs might experience timeouts and become unresponsive, because the storage controller gets overloaded. Even though VSAN has built-in throttling mechanism which is ON during those operations.

From the HCL update blog post:

Although Virtual SAN has a built-in throttling mechanism for rebuild operations, it is designed to make minimal progress in order to avoid Virtual SAN objects from being exposed to double component failures for a long time. In configurations with low queue-depth controllers, even this minimal progress can cause the controllers to get saturated, leading to high latency and IO time outs.

Given the above scenario, VMware has decided to remove controllers with queue depth of less than 256 from the Virtual SAN compatibility list. While fully functional, these controllers offer too low IO throughput to sustain the performance requirements of most VMware environments.

Back to the paper. This is new for me and I haven't read about it elsewhere. There are two types of hardware failures which can occur in VSAN cluster and depending of the component, VSAN will react differently in each particular case.

Two types of hardware failures can occur in VSAN cluster:

01. Absent Failure – physical network, network card (pNIC), host failure – 60 min delay (configurable parameter ClomRepairDelay – VMware KB 2075456 ) and then resync.

02. Degraded Failure – Magnetic Disk, Flash Device, Storage Controller – immediate resync non configurable!

VMware has in mind that the degraded failure has to do with elements which won't come back online any time soon. Disk drive if becomes unresponsive, then it's because it is dead. It completely makes sense.

Now you see how important is to have storage controller with queue depth higher than 256, but there is also another value to take into account. Which one? Cluster size.

Because the minimum size of VSAN cluster is 3 hosts only, the situation when there is a component failure will affect rebuild as the rebuild will simply won't happens! See the screenshot from the paper.

When a host failure occurs, the remaining two hosts are excluded from the resynchronization operation because the remaining nodes are already hosting objects and components for the affected virtual machines.

That's why it completely makes sense to build clusters with 4 hosts and more.

VSAN Rebalance and Resynchronization

So in this particular case there is no syncing until:

1. The host come back online
2. Another host is added to the VSAN cluster.

VMware Recommendations for VSAN?

To lower resync and rebalance times is best to use 10GbE and storage adapters with queue depth higher than 256 (now mandatory). From the cluster architecture perspective is recommended to create larger clusters with more hosts and more disk groups which allows to lower rebalance and rebuild times as the whole cluster acts as a hot spare disk.

Certainly interesting. The document is available as PDF and I can highly recommend to download as it clears some doubts and gives more arguments when talking about VSAN availability, performance during rebalance and resync operations.

Queue depth of 256 or higher prevents the controller’s IO queue from being overloaded, thus limiting IO operation time outs, high latency, and unresponsive virtual machines.

The author of the paper is Rawlinson Rivera (@punchingclouds).

Source and download: VMware vSphere Blog

Shares
5/5 - (1 vote)

| Filed Under: Server Virtualization Tagged With: VSAN rebalance, VSAN resynchronization

About Vladan SEGET

This website is maintained by Vladan SEGET. Vladan is as an Independent consultant, professional blogger, vExpert x16, Veeam Vanguard x9, VCAP-DCA/DCD, ESX Virtualization site has started as a simple bookmarking site, but quickly found a large following of readers and subscribers.

Connect on: Facebook. Feel free to network via Twitter @vladan.

Comments

  1. masood says

    December 25, 2017 at 1:16 pm

    HI Dear Vladan

    i have 10G network adapter on my vsan but vsan resyncing is very low (for 123GB estimate time is 1 day)
    would you help me plz

  2. Jose says

    June 6, 2020 at 8:39 pm

    Hi, if you have a cluster with for instance 10 ESXi, policy RAID-1 with FTT=1, a lot of free space and fail one host, the component will be rebuilt in another host of the cluster. In this case, is it possible another ESXi fail in the cluster? Thanks

    • Vladan SEGET says

      June 7, 2020 at 7:53 am

      Check the latest VSAN documentation. The post might be outdated. Once the rebuilding process finishes. Yes.

Private Sponsors

Featured

  • Thinking about HCI? G2, an independent tech solutions peer review platform, has published its Winter 2023 Reports on Hyperconverged Infrastructure (HCI) Solutions.
  • Zerto: One Platform for Disaster Recovery, Backup & Cloud Mobility: Try FREE Hands-On Labs Today!
Click to Become a Sponsor

Most Recent

  • FREE version of StarWind VSAN vs Trial of Full version
  • Commvault’s Innovations at RSA Conference 2025 San Francisco
  • VMware ESXi FREE is FREE again!
  • Installation of StarWind VSAN Plugin for vSphere
  • Protect Mixed environments with Nakivo Physical Machine recovery (bare metal)
  • No more FREE licenses of VMware vSphere for vExperts – What’s your options?
  • Tails – Your Private OS on USB Stick
  • StarWind V2V Converter Now has CLI
  • Veeam VHR ISO v2 – 2025 Download and Install
  • Deployment OVA and Installation of Nakivo Backup and Replication for VMware

Get new posts by email:

 

 

 

 

Support us on Ko-Fi

 

 

Buy Me a Coffee at ko-fi.com

Sponsors

Free Trials

  • DC Scope for VMware vSphere – optimization, capacity planning, and cost management. Download FREE Trial Here.
  • Augmented Inline Deduplication, Altaro VM Backup v9 For #VMware and #Hyper-V – Grab your copy now download TRIAL.

VMware Engineer Jobs

VMware Engineer Jobs

YouTube

…

Find us on Facebook

ESX Virtualization

…

Copyright © 2025 ·Dynamik-Gen · Genesis Framework · Log in