Skip to main content
Managed Kubernetes in GPU Cloud is a fully managed Kubernetes service for GPU-intensive and mixed-workload clusters. The control plane — the API server, scheduler, controller manager, and etcd — is operated by Gcore, while node pools and workloads remain under the operator’s control. Gcore is a CNCF certified Kubernetes provider, so existing tooling, manifests, and operators work without modification.

Cluster architecture

Each cluster consists of a Gcore-managed control plane and one or more node pools. The control plane manages scheduling, cluster state, and API access. Node pools are groups of worker nodes with identical configuration — instance type, volume settings, and network interface — where workloads run.
Managed Kubernetes architecture diagram showing the Gcore-managed control plane above three node pool types — virtual instances, bare metal, and GPU bare metal — with integrations to Object Storage, VAST Storage, Managed Logging, and Load Balancer
Pools within the same cluster can use different instance types, allowing GPU and CPU workloads to run side by side. The cluster autoscaler adjusts pool size based on pending pod demand and node utilization, within the min/max limits configured per pool.

Node types

A cluster can include any combination of the following node types:
TypeHardwareUse case
Virtual instancesStandard VMsDevelopment, testing, general workloads
Bare metal instancesDedicated physical serversHigh-performance CPU workloads
GPU bare metal instancesNVIDIA A100, H100, H200, L40s with built-in NVMe storageModel training, inference at scale
Available GPU models and bare metal configurations depend on the selected region.

Integrations

Clusters connect to Object Storage for artifact and dataset storage and to VAST Storage for high-performance NFS file shares optimized for GPU workloads. Managed Logging collects and stores cluster logs in OpenSearch Dashboards; Load Balancer integrates with the Kubernetes cloud controller manager for native traffic distribution.