Google Kubernetes Engine documentation
Deploy, manage, and scale containerized applications on Kubernetes, powered by Google Cloud. Learn more
Start your proof of concept with $300 in free credit
- Get access to Gemini 2.0 Flash Thinking
- Free monthly usage of popular products, including AI APIs and BigQuery
- No automatic charges, no commitment
Keep exploring with 20+ always-free products
Access 20+ free products for common use cases, including AI APIs, VMs, data warehouses, and more.
Documentation resources
AI/ML on GKE tutorials
-
AI/ML orchestration on GKE
-
Core concept: About GPUs in GKE
-
Core skill: Use GPUs in GKE
-
Serve Llama open models using GPUs with vLLM
-
Serve Gemma open models using GPUs with vLLM
-
Serve Gemma open models with Hugging Face TGI
-
Serve an LLM with multiple GPUs in GKE
-
Deploy GPUs for batch workloads with Dynamic Workload Scheduler
-
About Ray on GKE
Related resources
Related videos
Try GKE for yourself
Create an account to evaluate how our products perform in real-world
scenarios.
New customers also get $300 in free credits to run, test,
and deploy workloads.