You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/deployment/k8s.md
+102-3Lines changed: 102 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -4,6 +4,9 @@
4
4
5
5
Deploying vLLM on Kubernetes is a scalable and efficient way to serve machine learning models. This guide walks you through deploying vLLM using native Kubernetes.
6
6
7
+
*[Deployment with CPUs](#deployment-with-cpus)
8
+
*[Deployment with GPUs](#deployment-with-gpus)
9
+
7
10
Alternatively, you can deploy vLLM to Kubernetes using any of the following:
8
11
*[Helm](frameworks/helm.md)
9
12
*[InftyAI/llmaz](integrations/llmaz.md)
@@ -14,11 +17,107 @@ Alternatively, you can deploy vLLM to Kubernetes using any of the following:
0 commit comments