Skip to content

Commit 58e1f34

Browse files
authored
[Serve.llm] remove old vllm+serve doc (#51311)
<!-- Thank you for your contribution! Please review https://github.com/ray-project/ray/blob/master/CONTRIBUTING.rst before opening a pull request. --> <!-- Please add a reviewer to the assignee section when you create a PR. If you don't have the access to it, we will shortly find a reviewer and assign them to your PR. --> ## Why are these changes needed? Remove old vllm+serve doc so people are not confused with the new serve.llm apis. ## Related issue number <!-- For example: "Closes #1234" --> ## Checks - [ ] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( Signed-off-by: Gene Su <e870252314@gmail.com>
1 parent dc97201 commit 58e1f34

File tree

5 files changed

+1
-267
lines changed

5 files changed

+1
-267
lines changed

doc/BUILD

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -222,7 +222,6 @@ py_test_run_all_subdirectory(
222222
"source/serve/doc_code/intel_gaudi_inference_client.py",
223223
"source/serve/doc_code/distilbert.py",
224224
"source/serve/doc_code/stable_diffusion.py",
225-
"source/serve/doc_code/vllm_openai_example.py",
226225
"source/serve/doc_code/object_detection.py",
227226
"source/serve/doc_code/http_guide/streaming_example.py",
228227
"source/serve/doc_code/http_guide/websockets_example.py",

doc/source/cluster/kubernetes/examples/vllm-rayservice.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
# Serve a Large Language Model with vLLM on Kubernetes
44

5-
This guide demonstrates how to [Serve a Large Language Model with vLLM](https://docs.ray.io/en/latest/serve/tutorials/vllm-example.html) on Kubernetes using KubeRay. The example in this guide deploys the `meta-llama/Meta-Llama-3-8B-Instruct` model from Hugging Face on Google Kubernetes Engine (GKE).
5+
This guide demonstrates how to Serve a Large Language Model with vLLM on Kubernetes using KubeRay. The example in this guide deploys the `meta-llama/Meta-Llama-3-8B-Instruct` model from Hugging Face on Google Kubernetes Engine (GKE).
66

77
## Prerequisites
88

doc/source/serve/doc_code/vllm_openai_example.py

Lines changed: 0 additions & 205 deletions
This file was deleted.

doc/source/serve/examples.yml

Lines changed: 0 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -16,14 +16,6 @@ examples:
1616
- generative ai
1717
link: tutorials/stable-diffusion
1818
related_technology: ml applications
19-
- title: Serve a Large Language Model (LLM)
20-
skill_level: beginner
21-
use_cases:
22-
- generative ai
23-
- large language models
24-
- natural language processing
25-
link: tutorials/vllm-example
26-
related_technology: ml applications
2719
- title: Serve a Text Classification Model
2820
skill_level: beginner
2921
use_cases:

doc/source/serve/tutorials/vllm-example.md

Lines changed: 0 additions & 52 deletions
This file was deleted.

0 commit comments

Comments
 (0)