|
29 | 29 | [Please visit Deep Learning Framework (DLFW) website for the complete compatibility matrix](https://docs.nvidia.com/deeplearning/frameworks/support-matrix/index.html).
|
30 | 30 |
|
31 | 31 | # Release Compatibility Matrix
|
32 |
| - 1. [Container Name: trtllm-python-py3](#container-name-trtllm-python-py3) |
33 |
| - 2. [Container Name: vllm-python-py3](#container-name-vllm-python-py3) |
34 |
| - 3. [ONNX Runtime Versions](#onnx-runtime-versions) |
| 32 | +- [Release Compatibility Matrix](#release-compatibility-matrix) |
| 33 | + - [Container Name: trtllm-python-py3](#container-name-trtllm-python-py3) |
| 34 | + - [Container Name: vllm-python-py3](#container-name-vllm-python-py3) |
| 35 | + - [ONNX Runtime Versions](#onnx-runtime-versions) |
35 | 36 |
|
36 | 37 | ## Container Name: trtllm-python-py3
|
37 | 38 |
|
38 | 39 | | Triton release version | NGC Tag | Python version | Torch version | TensorRT version | TensorRT-LLM version | CUDA version | CUDA Driver version | Size |
|
39 | 40 | | --- | --- | --- | --- | --- | --- | --- | --- | --- |
|
| 41 | +| 25.04 | nvcr.io/nvidia/tritonserver:25.04-trtllm-python-py3 | Python 3.12.3 | 2.7.0a0+7c8ec84dab.nv25.3 | 10.9.0.34 | 0.18.2 | 12.8.1.012 | 570.124.06 | 17G | |
| 42 | +| 25.03 | nvcr.io/nvidia/tritonserver:25.03-trtllm-python-py3 | Python 3.12.3 | 2.7.0a0%2B7c8ec84dab.nv25.3 | 10.9.0.34 | 0.18.0 | 12.8.1.012 | 570.124.06 | 28G | |
40 | 43 | | 25.02 | nvcr.io/nvidia/tritonserver:25.02-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Becf3bae40a.nv25.1 | 10.8.0.43 | 0.17.0.post1 | 12.8.0.038 | 570.86.10 | 28G |
|
41 | 44 | | 25.01 | nvcr.io/nvidia/tritonserver:25.01-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Becf3bae40a.nv25.1 | 10.8.0.43 | 0.17.0 | 12.8.0.038 | 570.86.10 | 30G |
|
42 | 45 | | 24.12 | nvcr.io/nvidia/tritonserver:24.12-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Bdf5bbc09d1.nv24.11 | 10.7.0 | 0.16.0 | 12.6.3 | 560.35.05 | 22G |
|
|
53 | 56 |
|
54 | 57 | | Triton release version | NGC Tag | Python version | vLLM version | CUDA version | CUDA Driver version | Size |
|
55 | 58 | | --- | --- | --- | --- | --- | --- | --- |
|
| 59 | +| 25.04 | nvcr.io/nvidia/tritonserver:25.04-vllm-python-py3 | Python 3.12.3 | 0.8.1+5f4af9e0.nv25.4.cu129 | 12.9.0.036 | 575.51.02 | 10G | |
| 60 | +| 25.03 | nvcr.io/nvidia/tritonserver:25.03-vllm-python-py3 | Python 3.12.3 | 0.7.3+04de634a.nv25.3.cu128 | 12.8.1.012 | 570.124.06 | 22G | |
56 | 61 | | 25.02 | nvcr.io/nvidia/tritonserver:25.02-vllm-python-py3 | Python 3.12.3 | 0.7.0+5e800e3d.nv25.2.cu128 | 12.8.0.038 | 570.86.10 | 22G |
|
57 | 62 | | 25.01 | nvcr.io/nvidia/tritonserver:25.01-vllm-python-py3 | Python 3.12.3 | 0.6.3.post1 | 12.8.0.038 | 570.86.10 | 23G |
|
58 | 63 | | 24.12 | nvcr.io/nvidia/tritonserver:24.12-vllm-python-py3 | Python 3.12.3 | 0.5.5 | 12.6.3.004 | 560.35.05 | 20G |
|
|
69 | 74 |
|
70 | 75 | | Triton release version | ONNX Runtime |
|
71 | 76 | | --- | --- |
|
| 77 | +| 25.04 | 1.21.0 | |
| 78 | +| 25.03 | 1.21.0 | |
72 | 79 | | 25.02 | 1.20.1 |
|
73 | 80 | | 25.01 | 1.20.1 |
|
74 | 81 | | 24.12 | 1.20.1 |
|
|
0 commit comments