Vultr DocsLatest Content

Associated Doc

How Does Vultr Kubernetes Engine Handle Scaling?

Updated on 20 November, 2025

Explains how Vultr Kubernetes Engine implements scaling capabilities through native Kubernetes features integrated with Vultr's infrastructure


Vultr Kubernetes Engine (VKE) builds on Kubernetes’ native scaling features, such as Horizontal Pod Autoscaling (HPA) and the Cluster Autoscaler, and integrates them with Vultr infrastructure.

On VKE, you can scale in two main ways:

  • Manual Node Pool Scaling: You can increase or decrease the number of nodes in a pool directly from the Vultr Customer Portal or via the API. This gives you precise control over cluster capacity.
  • Automatic Node Pool Scaling: VKE supports autoscaling for node pools. Based on workload demand, the autoscaler adjusts the number of nodes within a specified range. This ensures your applications remain responsive during peak traffic while reducing costs during periods of low activity.

Because node pools run on Vultr Compute instances, scaling decisions directly translate to provisioning or de-provisioning underlying nodes. This makes scaling transparent, predictable, and cost-efficient.

For a deeper dive into how Kubernetes itself manages scaling at node level, see How to Scale Node Pools in a Vultr Kubernetes Engine Cluster).