TensorRT
Visit ToolTensorRT is an SDK for high-performance deep learning inference on NVIDIA GPUs. It provides tools and APIs to optimize and deploy AI models, accelerating AI inference workflows.
At a glance
Trending
Also listed in
TensorRT is an SDK for high-performance deep learning inference on NVIDIA GPUs. It provides tools and APIs to optimize and deploy AI models, accelerating AI inference workflows.
Trending
Also listed in
About
NVIDIAยฎ TensorRTโข is an SDK designed for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open-source components of TensorRT, including sources for TensorRT plugins and ONNX parser, along with sample applications demonstrating its usage and capabilities. It enables developers to optimize and deploy AI models efficiently, streamlining API usage for enhanced performance. The platform supports various CUDA versions and offers containerized build options for different Linux distributions and architectures, including cross-compilation for Jetson and DriveOS. TensorRT also provides a prebuilt Python package for easy installation via pip, allowing users to quickly integrate it into their Python-based AI projects.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending