You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**NOTE**: It may take some time for the containers to download the Docker images and LLM weights.
44
+
!!! note
45
+
It may take some time for the containers to download the Docker images and LLM weights.
45
46
46
47
### Send a Query to the Stack
47
48
@@ -149,6 +150,8 @@ In this YAML configuration:
149
150
***`requestGPU`**: Specifies the number of GPUs required.
150
151
***`pvcStorage`**: Allocates persistent storage for the model.
151
152
152
-
**NOTE:** If you intend to set up two pods, please refer to this [YAML file](https://github.com/vllm-project/production-stack/blob/main/tutorials/assets/values-01-2pods-minimal-example.yaml).
153
+
!!! note
154
+
If you intend to set up two pods, please refer to this [YAML file](https://github.com/vllm-project/production-stack/blob/main/tutorials/assets/values-01-2pods-minimal-example.yaml).
153
155
154
-
**NOTE:** vLLM production stack offers many more features (*e.g.* CPU offloading and a wide range of routing algorithms). Please check out these [examples and tutorials](https://github.com/vllm-project/production-stack/tree/main/tutorials) and our [repo](https://github.com/vllm-project/production-stack) for more details!
156
+
!!! tip
157
+
vLLM production stack offers many more features (*e.g.* CPU offloading and a wide range of routing algorithms). Please check out these [examples and tutorials](https://github.com/vllm-project/production-stack/tree/main/tutorials) and our [repo](https://github.com/vllm-project/production-stack) for more details!
0 commit comments