Skip to content

Commit 47299db

Browse files
Update supported models
1 parent 6cb1a75 commit 47299db

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

docs/zh/index.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,13 +13,13 @@
1313

1414
| Model | Data Type | PD Disaggregation | Chunked Prefill | Prefix Caching | MTP | CUDA Graph | Maximum Context Length |
1515
|:--- | :------- | :---------- | :-------- | :-------- | :----- | :----- | :----- |
16-
|ERNIE-4.5-300B-A47B | BF16/WINT4/WINT8/W4A8C8/WINT2/FP8 |(WINT4/W4A8C8/Expert Parallelism)|||✅(WINT4)| WIP |128K |
17-
|ERNIE-4.5-300B-A47B-Base| BF16/WINT4/WINT8 |(WINT4/Expert Parallelism)|||✅(WINT4)| | 128K |
16+
|ERNIE-4.5-300B-A47B | BF16/WINT4/WINT8/W4A8C8/WINT2/FP8 ||||✅(WINT4)| WIP |128K |
17+
|ERNIE-4.5-300B-A47B-Base| BF16/WINT4/WINT8 ||||✅(WINT4)| WIP | 128K |
1818
|ERNIE-4.5-VL-424B-A47B | BF16/WINT4/WINT8 | WIP || WIP || WIP |128K |
1919
|ERNIE-4.5-VL-28B-A3B | BF16/WINT4/WINT8 ||| WIP || WIP |128K |
2020
|ERNIE-4.5-21B-A3B | BF16/WINT4/WINT8/FP8 |||| WIP ||128K |
2121
|ERNIE-4.5-21B-A3B-Base | BF16/WINT4/WINT8/FP8 |||| WIP ||128K |
22-
|ERNIE-4.5-0.3B | BF16/WINT8/FP8 ||||||128K |
22+
|ERNIE-4.5-0.3B | BF16/WINT8/FP8 |||||| 128K |
2323

2424
## 文档说明
2525

0 commit comments

Comments
 (0)