Back to Triton Inference Server

Copyright (c) 2024-2026, NVIDIA CORPORATION. All rights reserved.

docs/introduction/compatibility.md

2.68.010.2 KB
Original Source
<!-- # Copyright (c) 2024-2026, NVIDIA CORPORATION. All rights reserved. # # Redistribution and use in source and binary forms, with or without # modification, are permitted provided that the following conditions # are met: # * Redistributions of source code must retain the above copyright # notice, this list of conditions and the following disclaimer. # * Redistributions in binary form must reproduce the above copyright # notice, this list of conditions and the following disclaimer in the # documentation and/or other materials provided with the distribution. # * Neither the name of NVIDIA CORPORATION nor the names of its # contributors may be used to endorse or promote products derived # from this software without specific prior written permission. # # THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY # EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE # IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR # PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR # CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, # EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT LIMITED TO, # PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, DATA, OR # PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY THEORY # OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT # (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE # OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. -->

Please visit Deep Learning Framework (DLFW) website for the complete compatibility matrix.

Release Compatibility Matrix

Container Name: trtllm-python-py3

Triton release versionNGC TagPython versionTorch versionTensorRT versionTensorRT-LLM versionCUDA versionCUDA Driver versionSize
26.04nvcr.io/nvidia/tritonserver:26.04-trtllm-python-py3Python 3.12.32.10.0a0+b4e4ee81d3.nv25.1210.14.1.481.2.113.1.0.036590.44.0114.22 GB
26.03nvcr.io/nvidia/tritonserver:26.03-trtllm-python-py3Python 3.12.32.10.0a0+b4e4ee81d3.nv25.1210.14.1.481.2.013.1.0.036590.44.0114.18 GB
26.02nvcr.io/nvidia/tritonserver:26.02-trtllm-python-py3Python 3.12.32.9.0a0+145a3a7bda.nv25.1010.13.3.91.1.013.0.2.006580.95.0516.17 GB
26.01nvcr.io/nvidia/tritonserver:26.01-trtllm-python-py3Python 3.12.32.9.0a0+145a3a7bda.nv25.1010.13.3.91.1.013.0.2.006580.95.0516.17 GB
25.12nvcr.io/nvidia/tritonserver:25.12-trtllm-python-py3Python 3.12.32.9.0a0+145a3a7bda.nv25.1010.13.3.91.1.013.0.2.006580.95.0516.04 GB
25.11nvcr.io/nvidia/tritonserver:25.11-trtllm-python-py3Python 3.12.32.9.0a0+145a3a7bda.nv25.1010.13.3.91.0.3.251013.0.2.006580.95.0512.25 GB
25.10nvcr.io/nvidia/tritonserver:25.10-trtllm-python-py3Python 3.12.32.8.0a0+5228986c39.nv25.610.11.0.331.0.012.9.1.010575.57.0816.21 GB
25.09nvcr.io/nvidia/tritonserver:25.09-trtllm-python-py3Python 3.12.32.8.0a0+5228986c39.nv25.610.11.0.331.0.012.9.1.010575.57.0816.25 GB
25.08nvcr.io/nvidia/tritonserver:25.08-trtllm-python-py3Python 3.12.32.8.0a0+5228986c39.nv25.510.11.0.330.21.012.9.0.043575.51.0320.49 GB
25.07nvcr.io/nvidia/tritonserver:25.07-trtllm-python-py3Python 3.12.32.7.0a0+79aa17489c.nv25.410.10.0.310.20.012.9.0.036575.51.0318.3G
25.06nvcr.io/nvidia/tritonserver:25.06-trtllm-python-py3Python 3.12.32.7.0a0+79aa17489c.nv25.410.10.0.310.20.012.9.0.036575.51.0318.3G
25.05nvcr.io/nvidia/tritonserver:25.05-trtllm-python-py3Python 3.12.32.7.0a0+7c8ec84dab.nv25.310.9.0.340.19.012.8.1.012570.124.0617G
25.04nvcr.io/nvidia/tritonserver:25.04-trtllm-python-py3Python 3.12.32.7.0a0+7c8ec84dab.nv25.310.9.0.340.18.212.8.1.012570.124.0617G
25.03nvcr.io/nvidia/tritonserver:25.03-trtllm-python-py3Python 3.12.32.7.0a0+7c8ec84dab.nv25.310.9.0.340.18.012.8.1.012570.124.0628G
25.02nvcr.io/nvidia/tritonserver:25.02-trtllm-python-py3Python 3.12.32.6.0a0+ecf3bae40a.nv25.110.8.0.430.17.0.post112.8.0.038570.86.1028G
25.01nvcr.io/nvidia/tritonserver:25.01-trtllm-python-py3Python 3.12.32.6.0a0+ecf3bae40a.nv25.110.8.0.430.17.012.8.0.038570.86.1030G
24.12nvcr.io/nvidia/tritonserver:24.12-trtllm-python-py3Python 3.12.32.6.0a0+df5bbc09d1.nv24.1110.7.00.16.012.6.3560.35.0522G
24.11nvcr.io/nvidia/tritonserver:24.11-trtllm-python-py3Python 3.10.122.5.0a0+e000cf0ad9.nv24.1010.6.00.15.012.6.3555.42.0624.8G
24.10nvcr.io/nvidia/tritonserver:24.10-trtllm-python-py3Python 3.10.122.4.0a0+3bcc3cddb5.nv24.710.4.00.14.012.5.1.007555.42.0623.3G
24.09nvcr.io/nvidia/tritonserver:24.09-trtllm-python-py3Python 3.10.122.4.0a0+3bcc3cddb5.nv24.710.4.00.13.012.5.1.007555.42.0621G
24.08nvcr.io/nvidia/tritonserver:24.08-trtllm-python-py3Python 3.10.122.4.0a0+3bcc3cddb5.nv24.710.3.00.12.012.5.1.007555.42.0621G
24.07nvcr.io/nvidia/tritonserver:24.07-trtllm-python-py3Python 3.10.122.4.0a0+07cecf4168.nv24.510.1.00.11.012.4.1.003550.54.1523G
24.06nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3Python 3.10.122.3.0a0+40ec155e58.nv24.310.0.10.10.012.4.0.041550.54.1431G
24.05nvcr.io/nvidia/tritonserver:24.05-trtllm-python-py3Python 3.10.122.3.0a0+ebedce210.0.1.60.9.012.3.2.001545.23.0834G
24.04nvcr.io/nvidia/tritonserver:24.04-trtllm-python-py3Python 3.10.122.3.0a0+ebedce29.3.0.post12.dev10.9.012.3.2.001545.23.0834G

Container Name: vllm-python-py3

Triton release versionNGC TagPython versionvLLM versionCUDA versionCUDA Driver versionSize
26.04nvcr.io/nvidia/tritonserver:26.04-vllm-python-py3Python 3.12.30.19.0+6bc3197f.nv26.04.4876126813.2.1.009595.58.039.09G
26.03nvcr.io/nvidia/tritonserver:26.03-vllm-python-py3Python 3.12.30.17.1+fb2e3ab6.nv26.3.46332470.cu13213.2.0.046595.45.049.22G
26.02nvcr.io/nvidia/tritonserver:26.02-vllm-python-py3Python 3.12.30.15.1+nv26.213.1.1.006590.48.018.9G
26.01nvcr.io/nvidia/tritonserver:26.01-vllm-python-py3Python 3.12.30.13.0+faa43dbf.nv26.1.cu13113.1.1.006590.48.018.79G
25.12nvcr.io/nvidia/tritonserver:25.12-vllm-python-py3Python 3.12.30.11.1+9114fd76.nv25.12.cu13113.1.0.036590.44.018.54G
25.11nvcr.io/nvidia/tritonserver:25.11-vllm-python-py3Python 3.12.30.11.0+582e4e37.nv25.11.cu13013.0.2.006580.95.058.72G
25.10nvcr.io/nvidia/tritonserver:25.10-vllm-python-py3Python 3.12.30.10.2+9dd9ca32.nv25.10.cu13013.0.2.006580.95.058.34G
25.09nvcr.io/nvidia/tritonserver:25.09-vllm-python-py3Python 3.12.30.10.1.1+381074ae.nv25.9.cu13013.0.1.012580.82.077.78G
25.08nvcr.io/nvidia/tritonserver:25.08-vllm-python-py3Python 3.12.30.9.2+4ef1e343.nv25.8.post1.cu13013.0.1.012580.82.078.1G
25.07nvcr.io/nvidia/tritonserver:25.07-vllm-python-py3Python 3.12.30.9.0rc1+1958ee56.nv25.6.cu12912.9.0.043575.51.0310G
25.06nvcr.io/nvidia/tritonserver:25.06-vllm-python-py3Python 3.12.30.9.0rc1+1958ee56.nv25.6.cu12912.9.0.043575.51.0310G
25.05nvcr.io/nvidia/tritonserver:25.05-vllm-python-py3Python 3.12.30.8.4+dc1a3e10.nv25.5.cu12912.9.0.043575.51.0310G
25.04nvcr.io/nvidia/tritonserver:25.04-vllm-python-py3Python 3.12.30.8.1+5f4af9e0.nv25.4.cu12912.9.0.036575.51.0210G
25.03nvcr.io/nvidia/tritonserver:25.03-vllm-python-py3Python 3.12.30.7.3+04de634a.nv25.3.cu12812.8.1.012570.124.0622G
25.02nvcr.io/nvidia/tritonserver:25.02-vllm-python-py3Python 3.12.30.7.0+5e800e3d.nv25.2.cu12812.8.0.038570.86.1022G
25.01nvcr.io/nvidia/tritonserver:25.01-vllm-python-py3Python 3.12.30.6.3.post112.8.0.038570.86.1023G
24.12nvcr.io/nvidia/tritonserver:24.12-vllm-python-py3Python 3.12.30.5.512.6.3.004560.35.0520G
24.11nvcr.io/nvidia/tritonserver:24.11-vllm-python-py3Python 3.12.30.5.512.6.3.001560.35.0522.1G
24.10nvcr.io/nvidia/tritonserver:24.10-vllm-python-py3Python 3.10.120.5.512.6.2.004560.35.0321G
24.09nvcr.io/nvidia/tritonserver:24.09-vllm-python-py3Python 3.10.120.5.3.post112.6.1.006560.35.0319G
24.08nvcr.io/nvidia/tritonserver:24.08-vllm-python-py3Python 3.10.120.5.0 post112.6.0.022560.35.0319G
24.07nvcr.io/nvidia/tritonserver:24.07-vllm-python-py3Python 3.10.120.5.0 post112.5.1555.42.0619G
24.06nvcr.io/nvidia/tritonserver:24.06-vllm-python-py3Python 3.10.120.4.312.5.0.23555.42.0218G
24.05nvcr.io/nvidia/tritonserver:24.05-vllm-python-py3Python 3.10.120.4.0 post112.4.1550.54.1518G
24.04nvcr.io/nvidia/tritonserver:24.04-vllm-python-py3Python 3.10.120.4.0 post112.4.1550.54.1517G

ONNX Runtime Versions

Triton release versionONNX Runtime
26.041.24.4
26.031.24.2
26.021.24.1
26.011.23.2
25.121.23.2
25.111.23.2
25.101.23.1
25.091.23.0
25.081.23.0+1d1712fdaf
25.071.22.0
25.061.22.0
25.051.22.0
25.041.21.0
25.031.21.0
25.021.20.1
25.011.20.1
24.121.20.1
24.111.19.2
24.101.19.2
24.091.19.2
24.081.18.1
24.071.18.1
24.061.18.0
24.051.18.0
24.041.17.3