One of Ocean’s key features for optimizing scaling is headroom, a buffer of spare compute capacity that ensures a cluster is always ready for a rapid application scale up.
When you configure headroom in specific amounts of resources (i.e., vCPU, memory, and GPU), or specify headroom as a percentage of the cluster’s total requested resources, the cluster can scale workloads without waiting for new instances to be provisioned.
Ocean optimally manages the headroom dynamically in order to provide the best possible cost-performance balance. Alternatively, headroom may be configured manually, or even scheduled to support workloads that have peaks
Automatic refers to headroom that is dynamic and designed to accommodate the next scale up of services in the cluster. It is configured on the Ocean cluster level or VNG as a percentage of the resources. The automatic headroom is being calculated hourly and will accommodate according to the required amount of resources in the cluster.
Manual headroom refers to headroom requirements that are specified by the user, the manual headroom is configured on either the Ocean and/or VNG level and is specified as headroom units that consist of the following components:
Ocean supports configuring both automatic and manual headroom on the same cluster (one on top of the other)