[Doc][0.7.3] Add release note for 0.7.3.post1 (#1010)

wangxiyuan · web-flow · commit c69ceaca1ecc · 2025-05-29T17:48:16.000+08:00
cherry-pick 0.7.3.post1 release note from main

Signed-off-by: wangxiyuan &lt;wangxiyuan1007@gmail.com&gt;
diff --git a/docs/source/conf.py b/docs/source/conf.py
@@ -67,10 +67,10 @@
     # the branch of vllm-ascend, used in vllm-ascend clone and image tag
     # - main branch: 'main'
     # - vX.Y.Z branch: latest vllm-ascend release tag
-    'vllm_ascend_version': 'v0.7.3',
+    'vllm_ascend_version': 'v0.7.3.post1',
     # the newest release version of vllm-ascend and matched vLLM, used in pip install.
     # This value should be updated when cut down release.
-    'pip_vllm_ascend_version': "0.7.3",
+    'pip_vllm_ascend_version': "0.7.3.post1",
     'pip_vllm_version': "0.7.3",
     # The maching MindIE Turbo for vLLM Ascend
     'pip_mindie_turbo_version': "2.0rc1",
diff --git a/docs/source/developer_guide/versioning_policy.md b/docs/source/developer_guide/versioning_policy.md
@@ -67,6 +67,7 @@ Following is the Release Compatibility Matrix for vLLM Ascend Plugin:
 
 | vllm-ascend  | vLLM         | Python | Stable CANN | PyTorch/torch_npu | MindIE Turbo |
 |--------------|--------------| --- | --- | --- | --- |
+| v0.7.3.post1 | v0.7.3 | 3.9 - 3.11 | 8.1.0   |  2.5.1 / 2.5.1 | 2.0rc1 |
 | v0.7.3 | v0.7.3 | 3.9 - 3.11 | 8.1.0   |  2.5.1 / 2.5.1 | 2.0rc1 |
 | v0.7.3rc2 | v0.7.3 | 3.9 - 3.11 | 8.0.0   |  2.5.1 / 2.5.1.dev20250320 | / |
 | v0.7.3rc1 | v0.7.3 | 3.9 - 3.11 | 8.0.0   |  2.5.1 / 2.5.1.dev20250308 | / |
@@ -78,6 +79,7 @@ Following is the Release Compatibility Matrix for vLLM Ascend Plugin:
 
 | Date       | Event                                     |
 |------------|-------------------------------------------|
+| 2025.05.29 | Final release, v0.7.3.post1               |
 | 2025.05.08 | Final release, v0.7.3                     |
 | 2025.04.17 | Release candidates, v0.8.4rc1             |
 | 2025.03.28 | Release candidates, v0.7.3rc2             |
diff --git a/docs/source/faqs.md b/docs/source/faqs.md
@@ -2,10 +2,7 @@
 
 ## Version Specific FAQs
 
-- [[v0.7.1rc1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/19)
-- [[v0.7.3rc1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/267)
-- [[v0.7.3rc2] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/418)
-- [[v0.8.4rc1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/546)
+- [[v0.7.3.post1] FAQ & Feedback](https://github.com/vllm-project/vllm-ascend/issues/1007)
 
 ## General FAQs
 
diff --git a/docs/source/user_guide/release_notes.md b/docs/source/user_guide/release_notes.md
@@ -1,5 +1,27 @@
 # Release note
 
+## v0.7.3.post1
+
+This is the first post release of 0.7.3. Please follow the [official doc](https://vllm-ascend.readthedocs.io/en/v0.7.3-dev) to start the journey. It includes the following changes:
+
+### Highlights
+
+- Qwen3 and Qwen3MOE is supported now. The performance and accuracy of Qwen3 is well tested. You can try it now. Mindie Turbo is recomanded to improve the performance of Qwen3. [#903](https://github.com/vllm-project/vllm-ascend/pull/903) [#915](https://github.com/vllm-project/vllm-ascend/pull/915)
+- Added a new performance guide. The guide aims to help users to improve vllm-ascend performance on system level. It includes OS configuration, library optimization, deploy guide and so on. [#878](https://github.com/vllm-project/vllm-ascend/pull/878) [Doc Link](https://vllm-ascend.readthedocs.io/en/v0.7.3-dev/developer_guide/performance/optimization_and_tuning.html)
+
+### Bug Fix
+
+- Qwen2.5-VL works for RLHF scenarios now. [#928](https://github.com/vllm-project/vllm-ascend/pull/928)
+- Users can launch the model from online weights now. e.g. from huggingface or modelscope directly [#858](https://github.com/vllm-project/vllm-ascend/pull/858) [#918](https://github.com/vllm-project/vllm-ascend/pull/918)
+- The meaningless log info `UserWorkspaceSize0` has been cleaned. [#911](https://github.com/vllm-project/vllm-ascend/pull/911)
+- The log level for `Failed to import vllm_ascend_C` has been changed to `warning` instead of `error`. [#956](https://github.com/vllm-project/vllm-ascend/pull/956)
+- DeepSeek MLA now works with chunked prefill in V1 Engine. Please note that V1 engine in 0.7.3 is just expermential and only for test usage. [#849](https://github.com/vllm-project/vllm-ascend/pull/849) [#936](https://github.com/vllm-project/vllm-ascend/pull/936)
+
+### Docs
+
+- The benchmark doc is updated for Qwen2.5 and Qwen2.5-VL [#792](https://github.com/vllm-project/vllm-ascend/pull/792)
+- Add the note to clear that only "modelscope<1.23.0" works with 0.7.3. [#954](https://github.com/vllm-project/vllm-ascend/pull/954)
+
 ## v0.7.3
 
 🎉 Hello, World!