File tree Expand file tree Collapse file tree 2 files changed +2
-2
lines changed Expand file tree Collapse file tree 2 files changed +2
-2
lines changed Original file line number Diff line number Diff line change @@ -13,7 +13,7 @@ export FD_MODEL_CACHE=/ssd1/download_models
13
13
14
14
| Model Name | Context Length | Quantization | Minimum Deployment Resources | Notes |
15
15
| :--------- | :------------- | :----------- | :-------------------------- | :---- |
16
- | baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1* 96G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
16
+ | baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1* 141G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
17
17
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT4 | 4* 80G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
18
18
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT8 | 8* 80G GPU VRAM/1T RAM | Chunked Prefill required for 128K |
19
19
| baidu/ERNIE-4.5-300B-A47B-Paddle | 32K/128K | WINT4 | 4* 64G GPU VRAM/600G RAM | Chunked Prefill required for 128K |
Original file line number Diff line number Diff line change @@ -14,7 +14,7 @@ export FD_MODEL_CACHE=/ssd1/download_models
14
14
15
15
| 模型名 | 上下文长度 | 量化方式 | 最小部署资源 | 说明 |
16
16
| :----- | :-------------- | :----------- | :----------- | :----------- |
17
- | baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1卡* 96G显存 /1T内存 | 128K需要开启Chunked Prefill |
17
+ | baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT2 | 1卡* 141G显存 /1T内存 | 128K需要开启Chunked Prefill |
18
18
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT4 | 4卡* 80G显存/1T内存 | 128K需要开启Chunked Prefill |
19
19
| baidu/ERNIE-4.5-VL-424B-A47B-Paddle | 32K/128K | WINT8 | 8卡* 80G显存/1T内存 | 128K需要开启Chunked Prefill |
20
20
| baidu/ERNIE-4.5-300B-A47B-Paddle | 32K/128K | WINT4 | 4卡* 64G显存/600G内存 | 128K需要开启Chunked Prefill |
You can’t perform that action at this time.
0 commit comments