Skip to content

Commit d79c4f1

Browse files
R25.04 compatibility (#8201)
Co-authored-by: Kyle McGill <101670481+nv-kmcgill53@users.noreply.github.com>
1 parent 4226ab8 commit d79c4f1

File tree

1 file changed

+10
-3
lines changed

1 file changed

+10
-3
lines changed

docs/introduction/compatibility.md

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -29,14 +29,17 @@
2929
[Please visit Deep Learning Framework (DLFW) website for the complete compatibility matrix](https://docs.nvidia.com/deeplearning/frameworks/support-matrix/index.html).
3030

3131
# Release Compatibility Matrix
32-
1. [Container Name: trtllm-python-py3](#container-name-trtllm-python-py3)
33-
2. [Container Name: vllm-python-py3](#container-name-vllm-python-py3)
34-
3. [ONNX Runtime Versions](#onnx-runtime-versions)
32+
- [Release Compatibility Matrix](#release-compatibility-matrix)
33+
- [Container Name: trtllm-python-py3](#container-name-trtllm-python-py3)
34+
- [Container Name: vllm-python-py3](#container-name-vllm-python-py3)
35+
- [ONNX Runtime Versions](#onnx-runtime-versions)
3536

3637
## Container Name: trtllm-python-py3
3738

3839
| Triton release version | NGC Tag | Python version | Torch version | TensorRT version | TensorRT-LLM version | CUDA version | CUDA Driver version | Size |
3940
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
41+
| 25.04 | nvcr.io/nvidia/tritonserver:25.04-trtllm-python-py3 | Python 3.12.3 | 2.7.0a0+7c8ec84dab.nv25.3 | 10.9.0.34 | 0.18.2 | 12.8.1.012 | 570.124.06 | 17G |
42+
| 25.03 | nvcr.io/nvidia/tritonserver:25.03-trtllm-python-py3 | Python 3.12.3 | 2.7.0a0%2B7c8ec84dab.nv25.3 | 10.9.0.34 | 0.18.0 | 12.8.1.012 | 570.124.06 | 28G |
4043
| 25.02 | nvcr.io/nvidia/tritonserver:25.02-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Becf3bae40a.nv25.1 | 10.8.0.43 | 0.17.0.post1 | 12.8.0.038 | 570.86.10 | 28G |
4144
| 25.01 | nvcr.io/nvidia/tritonserver:25.01-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Becf3bae40a.nv25.1 | 10.8.0.43 | 0.17.0 | 12.8.0.038 | 570.86.10 | 30G |
4245
| 24.12 | nvcr.io/nvidia/tritonserver:24.12-trtllm-python-py3 | Python 3.12.3 | 2.6.0a0%2Bdf5bbc09d1.nv24.11 | 10.7.0 | 0.16.0 | 12.6.3 | 560.35.05 | 22G |
@@ -53,6 +56,8 @@
5356

5457
| Triton release version | NGC Tag | Python version | vLLM version | CUDA version | CUDA Driver version | Size |
5558
| --- | --- | --- | --- | --- | --- | --- |
59+
| 25.04 | nvcr.io/nvidia/tritonserver:25.04-vllm-python-py3 | Python 3.12.3 | 0.8.1+5f4af9e0.nv25.4.cu129 | 12.9.0.036 | 575.51.02 | 10G |
60+
| 25.03 | nvcr.io/nvidia/tritonserver:25.03-vllm-python-py3 | Python 3.12.3 | 0.7.3+04de634a.nv25.3.cu128 | 12.8.1.012 | 570.124.06 | 22G |
5661
| 25.02 | nvcr.io/nvidia/tritonserver:25.02-vllm-python-py3 | Python 3.12.3 | 0.7.0+5e800e3d.nv25.2.cu128 | 12.8.0.038 | 570.86.10 | 22G |
5762
| 25.01 | nvcr.io/nvidia/tritonserver:25.01-vllm-python-py3 | Python 3.12.3 | 0.6.3.post1 | 12.8.0.038 | 570.86.10 | 23G |
5863
| 24.12 | nvcr.io/nvidia/tritonserver:24.12-vllm-python-py3 | Python 3.12.3 | 0.5.5 | 12.6.3.004 | 560.35.05 | 20G |
@@ -69,6 +74,8 @@
6974

7075
| Triton release version | ONNX Runtime |
7176
| --- | --- |
77+
| 25.04 | 1.21.0 |
78+
| 25.03 | 1.21.0 |
7279
| 25.02 | 1.20.1 |
7380
| 25.01 | 1.20.1 |
7481
| 24.12 | 1.20.1 |

0 commit comments

Comments
 (0)