Skip to content

Commit ef72873

Browse files
committed
Update kunlunxin_xpu.md
1 parent 90a5b18 commit ef72873

File tree

2 files changed

+11
-11
lines changed

2 files changed

+11
-11
lines changed

docs/get_started/installation/kunlunxin_xpu.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ python -m pip install --pre paddlepaddle-xpu -i https://www.paddlepaddle.org.cn/
4343
### Install FastDeploy (**Do NOT install via PyPI source**)
4444

4545
```bash
46-
python -m pip install fastdeploy-xpu==2.0.0 -i https://www.paddlepaddle.org.cn/packages/stable/xpu-p800/
46+
python -m pip install fastdeploy-xpu==2.0.0 -i https://www.paddlepaddle.org.cn/packages/stable/xpu-p800/
4747
```
4848

4949
Alternatively, you can install the latest version of FastDeploy (Not recommended)
@@ -52,7 +52,7 @@ Alternatively, you can install the latest version of FastDeploy (Not recommended
5252
python -m pip install --pre fastdeploy-xpu -i https://www.paddlepaddle.org.cn/packages/nightly/xpu-p800/
5353
```
5454

55-
### 3. Build wheel from source
55+
## 3. Build wheel from source
5656

5757
### Install PaddlePaddle
5858

@@ -115,9 +115,9 @@ Currently, P800 has only validated deployment of the following models:
115115
- ERNIE-4.5-300B-A47B-Paddle 128K WINT4 (8-card)
116116

117117
### Offline inference
118-
118+
119119
After installing FastDeploy, you can perform offline text generation with user-provided prompts using the following code,
120-
120+
121121
```python
122122
from fastdeploy import LLM, SamplingParams
123123

@@ -141,11 +141,11 @@ for output in outputs:
141141

142142
Refer to [Parameters](../../parameters.md) for more configuration options.
143143

144-
## Online serving (OpenAI API-Compatible server)
144+
### Online serving (OpenAI API-Compatible server)
145145

146146
Deploy an OpenAI API-compatible server using FastDeploy with the following commands:
147147

148-
### Start service
148+
#### Start service
149149

150150
**ERNIE-4.5-300B-A47B-Paddle 32K WINT4 (8-card) (Recommended)**
151151

@@ -175,7 +175,7 @@ python -m fastdeploy.entrypoints.openai.api_server \
175175

176176
Refer to [Parameters](../../parameters.md) for more options.
177177

178-
### Send requests
178+
#### Send requests
179179

180180
Send requests using either curl or Python
181181

@@ -218,4 +218,4 @@ for chunk in response:
218218
print('\n')
219219
```
220220

221-
For detailed OpenAI protocol specifications, see [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create). Differences from the standard OpenAI protocol are documented in [Deployment](../../serving/README.md).
221+
For detailed OpenAI protocol specifications, see [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create).

docs/zh/get_started/installation/kunlunxin_xpu.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ python -m pip install --pre paddlepaddle-xpu -i https://www.paddlepaddle.org.cn/
4343
### 安装 FastDeploy(**注意不要通过 pypi 源安装**
4444

4545
```bash
46-
python -m pip install fastdeploy-xpu==2.0.0 -i https://www.paddlepaddle.org.cn/packages/stable/xpu-p800/
46+
python -m pip install fastdeploy-xpu==2.0.0 -i https://www.paddlepaddle.org.cn/packages/stable/xpu-p800/
4747
```
4848

4949
或者你也可以安装最新版 FastDeploy(不推荐)
@@ -95,6 +95,7 @@ git checkout <tag or branch>
9595
cd FastDeploy
9696
bash build.sh
9797
```
98+
9899
编译后的产物在 ```FastDeploy/dist``` 目录下。
99100

100101
## 验证是否安装成功
@@ -222,5 +223,4 @@ for chunk in response:
222223
print('\n')
223224
```
224225

225-
OpenAI 协议的更多说明可参考文档 [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create),以及与 OpenAI 协议的区别可以参考 [服务化部署](../../serving/README.md)
226-
226+
OpenAI 协议的更多说明可参考文档 [OpenAI Chat Compeltion API](https://platform.openai.com/docs/api-reference/chat/create)

0 commit comments

Comments
 (0)