# TensorRT-YOLOn **Repository Path**: ctpactwangke/tensor-rt-yolon ## Basic Information - **Project Name**: TensorRT-YOLOn - **Description**: No description available - **Primary Language**: Unknown - **License**: GPL-3.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-10-26 - **Last Updated**: 2025-10-26 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README English | [简体中文](README.md)

--- 🚀 TensorRT-YOLO is an **easy-to-use**, **extremely efficient** inference deployment tool for the **YOLO series** designed specifically for NVIDIA devices. The project not only integrates TensorRT plugins to enhance post-processing but also utilizes CUDA kernels and CUDA graphs to accelerate inference. TensorRT-YOLO provides support for both C++ and Python inference, aiming to deliver a 📦**out-of-the-box** deployment experience. It covers various task scenarios such as [object detection](examples/detect/), [instance segmentation](examples/segment/), [image classification](examples/classify/), [pose estimation](examples/pose/), [oriented object detection](examples/obb/), and [video analysis](examples/VideoPipe), meeting developers' deployment needs in **multiple scenarios**.

[

](examples/obb/) [

](examples/detect/) [

](examples/segment/) [

](examples/pose/) [

](examples/videopipe)

🌠 Recent updates

- 2025-04-19: Added support for [YOLO-World](https://docs.ultralytics.com/zh/models/yolo-world/) and [YOLOE](https://docs.ultralytics.com/zh/models/yoloe/), including classification, oriented bounding boxes, pose estimation, and instance segmentation. See [Bilibili](https://www.bilibili.com/video/BV12N5bzkENV) for details. 🌟 NEW - 2025-03-29: Added support for [YOLO12](https://github.com/sunsmarterjie/yolov12), including classification, oriented bounding boxes, pose estimation, and instance segmentation. See [issues](https://github.com/sunsmarterjie/yolov12/issues/22) for details. 🌟 NEW - [Performance Leap! TensorRT-YOLO 6.0: Comprehensive Upgrade Analysis and Practical Guide](https://medium.com/@laugh12321/performance-leap-tensorrt-yolo-6-0-comprehensive-upgrade-analysis-and-practical-guide-9d19ad3b53f9) 🌟 NEW ##

✨ Key Features

### 🎯 Diverse YOLO Support - **Comprehensive Compatibility**: Supports YOLOv3 to YOLO12 series models, as well as PP-YOLOE, PP-YOLOE+, YOLO-World, and YOLOE, meeting diverse needs. See [🖥️ Supported Models List](#support-models) for details. - **Flexible Switching**: Provides simple and easy-to-use interfaces for quick switching between different YOLO versions. 🌟 NEW - **Multi-Scenario Applications**: Offers rich example codes covering [Detect](examples/detect/), [Segment](examples/segment/), [Classify](examples/classify/), [Pose](examples/pose/), [OBB](examples/obb/), and more. ### 🚀 Performance Optimization - **CUDA Acceleration**: Optimizes pre-processing through CUDA kernels and accelerates inference using CUDA graphs. - **TensorRT Integration**: Deeply integrates TensorRT plugins to significantly speed up post-processing and improve overall inference efficiency. - **Multi-Context Inference**: Supports multi-context parallel inference to maximize hardware resource utilization. 🌟 NEW - **Memory Management Optimization**: Adapts multi-architecture memory optimization strategies (e.g., Zero Copy mode for Jetson) to enhance memory efficiency. 🌟 NEW ### 🛠️ Usability - **Out-of-the-Box**: Provides comprehensive C++ and Python inference support to meet different developers' needs. - **CLI Tools**: Built-in command-line tools for quick model export and inference, improving development efficiency. - **Docker Support**: Offers one-click Docker deployment solutions to simplify environment configuration and deployment processes. - **No Third-Party Dependencies**: All functionalities are implemented using standard libraries, eliminating the need for additional dependencies and simplifying deployment. - **Easy Deployment**: Provides dynamic library compilation support for easy calling and deployment. ### 🌐 Compatibility - **Multi-Platform Support**: Fully compatible with various operating systems and hardware platforms, including Windows, Linux, ARM, and x86. - **TensorRT Compatibility**: Perfectly adapts to TensorRT 10.x versions, ensuring seamless integration with the latest technology ecosystem. ### 🔧 Flexible Configuration - **Customizable Preprocessing Parameters**: Supports flexible configuration of various preprocessing parameters, including **channel swapping (SwapRB)**, **normalization parameters**, and **border padding**. 🌟 NEW ##

🚀 Performance

| Model | Official + trtexec (ms) | trtyolo + trtexec (ms) | TensorRT-YOLO Inference (ms)| |:-----:|:-----------------------:|:----------------------:|:---------------------------:| | YOLOv11n | 1.611 ± 0.061 | 1.428 ± 0.097 | 1.228 ± 0.048 | | YOLOv11s | 2.055 ± 0.147 | 1.886 ± 0.145 | 1.687 ± 0.047 | | YOLOv11m | 3.028 ± 0.167 | 2.865 ± 0.235 | 2.691 ± 0.085 | | YOLOv11l | 3.856 ± 0.287 | 3.682 ± 0.309 | 3.571 ± 0.102 | | YOLOv11x | 6.377 ± 0.487 | 6.195 ± 0.482 | 6.207 ± 0.231 |

> [!NOTE] > > **Testing Environment** > - **GPU**: NVIDIA RTX 2080 Ti 22GB > - **Input Size**: 640×640 pixels > > **Testing Tools** > - **Official**: Using the ONNX model exported by Ultralytics. > - **trtyolo**: Using the CLI tool (trtyolo) provided by TensorRT-YOLO to export the ONNX model with the EfficientNMS plugin. > - **trtexec**: Using NVIDIA's `trtexec` tool to build the ONNX model into an engine and perform inference testing. > - **Build Command**: `trtexec --onnx=xxx.onnx --saveEngine=xxx.engine --fp16` > - **Test Command**: `trtexec --avgRuns=1000 --useSpinWait --loadEngine=xxx.engine` > - **TensorRT-YOLO Inference**: Using the TensorRT-YOLO framework to measure the latency (including pre-processing, inference, and post-processing) of the engine obtained through the **trtyolo + trtexec** method. ##

🔮 Documentation

- **Installation Guide** - [📦 Quick Compilation and Installation](docs/en/build_and_install.md) - **Usage Examples** - [Object Detection Example](examples/detect/README.en.md) - [Instance Segmentation Example](examples/segment/README.en.md) - [Image Classification Example](examples/classify/README.en.md) - [Pose Estimation Example](examples/pose/README.en.md) - [Oriented Object Detection Example](examples/obb/README.en.md) - [📹 Video Analysis Example](examples/VideoPipe/README.en.md) - [Multi-threading and Multi-processing Example](examples/mutli_thread/README.en.md) 🌟 NEW - **API Documentation** - Python API Documentation (⚠️ Not Implemented) - C++ API Documentation (⚠️ Not Implemented) - **FAQ** - ⚠️ Collecting ... - **Supported Models List** - [🖥️ Supported Models List](#support-models) ##

💨 Quick Start

### 1. Prerequisites - **CUDA**: Recommended version ≥ 11.0.1 - **TensorRT**: Recommended version ≥ 8.6.1 - **Operating System**: Linux (x86_64 or arm) (recommended); Windows is also supported ### 2. Installation - Refer to the [📦 Quick Compilation and Installation](docs/en/build_and_install.md) documentation. ### 3. Model Export - Refer to the [🔧 Model Export](docs/en/model_export.md) documentation to export an ONNX model suitable for inference in this project and build it into a TensorRT engine. ### 4. Inference Example > [!NOTE] > > `ClassifyModel`, `DetectModel`, `OBBModel`, `SegmentModel`, and `PoseModel` correspond to image classification (Classify), detection (Detect), oriented bounding box (OBB), segmentation (Segment), and pose estimation (Pose) models, respectively. - Inference using Python: ```python import cv2 from tensorrt_yolo.infer import InferOption, DetectModel, generate_labels, visualize def main(): # -------------------- Initialization -------------------- # Configure inference settings option = InferOption() option.enable_swap_rb() # Convert OpenCV's default BGR format to RGB # Special model configuration example (uncomment for PP-YOLOE series) # option.setnormalizeparams([0.485, 0.456, 0.406], [0.229, 0.224, 0.225]) # -------------------- Model Initialization -------------------- # Load TensorRT engine file (ensure the path is correct) # Note: Initial engine loading may take longer due to optimization model = DetectModel(engine_file="yolo11n-with-plugin.engine", option=option) # -------------------- Data Preprocessing -------------------- # Load test image (add file existence check) input_img = cv2.imread("test_image.jpg") if input_img is None: raise FileNotFoundError("Failed to load test image. Check the file path.") # -------------------- Inference Execution -------------------- # Perform object detection (returns bounding boxes, confidence scores, and class labels) detection_result = model.predict(input_img) print(f"==> detection_result: {detection_result}") # -------------------- Result Visualization -------------------- # Load class labels (ensure labels.txt matches the model) class_labels = generate_labels(labels_file="labels.txt") # Generate visualized result visualized_img = visualize( image=input_img, result=detection_result, labels=class_labels, ) cv2.imwrite("vis_image.jpg", visualized_img) # -------------------- Model Cloning Demo -------------------- # Clone model instance (for multi-threaded scenarios) cloned_model = model.clone() # Create an independent copy to avoid resource contention # Verify cloned model inference consistency cloned_result = cloned_model.predict(input_img) print(f"==> cloned_result: {cloned_result}") if __name__ == "__main__": main() ``` - Inference using C++: ```cpp #include #include #include "trtyolo.hpp" int main() { try { // -------------------- Initialization -------------------- trtyolo::InferOption option; option.enableSwapRB(); // BGR->RGB conversion // Special model parameter setup example // const std::vector mean{0.485f, 0.456f, 0.406f}; // const std::vector std{0.229f, 0.224f, 0.225f}; // option.setNormalizeParams(mean, std); // -------------------- Model Initialization -------------------- auto detector = std::make_unique( "yolo11n-with-plugin.engine", // Model path option // Inference settings ); // -------------------- Data Loading -------------------- cv::Mat cv_image = cv::imread("test_image.jpg"); if (cv_image.empty()) { throw std::runtime_error("Failed to load test image."); } // Encapsulate image data (no pixel data copying) trtyolo::Image input_image( cv_image.data, // Pixel data pointer cv_image.cols, // Image width cv_image.rows // Image height ); // -------------------- Inference Execution -------------------- trtyolo::DetectRes result = detector->predict(input_image); std::cout << result << std::endl; // -------------------- Result Visualization (Example) -------------------- // Implement visualization logic in actual development, e.g.: // cv::Mat vis_image = visualize_detections(cv_image, result); // cv::imwrite("vis_result.jpg", vis_image); // -------------------- Model Cloning Demo -------------------- auto cloned_detector = detector->clone(); // Create an independent instance trtyolo::DetectRes cloned_result = cloned_detector->predict(input_image); // Verify result consistency std::cout << cloned_result << std::endl; } catch (const std::exception& e) { std::cerr << "Program Exception: " << e.what() << std::endl; return EXIT_FAILURE; } return EXIT_SUCCESS; } ``` ### 5. Inference Flowchart Below is the flowchart of the `predict` method, which illustrates the complete process from input image to output result:

Simply pass the image to be inferred to the `predict` method. The `predict` method will automatically complete preprocessing, model inference, and post-processing internally, and output the inference results. These results can be further applied to downstream tasks (such as visualization, object tracking, etc.). > For more deployment examples, please refer to the [Model Deployment Examples](examples) section. ##

🖥️ Model Support List

Symbol legend: (1) ✅ : Supported; (2) ❔: In progress; (3) ❎ : Not supported; (4) ❎ : Self-implemented export required for inference.

Task Scenario	Model	CLI Export	Inference Deployment
Detect	ultralytics/yolov3	✅	✅
Detect	ultralytics/yolov5	✅	✅
Detect	meituan/YOLOv6	❎ Refer to official export tutorial	✅
Detect	WongKinYiu/yolov7	❎ Refer to official export tutorial	✅
Detect	WongKinYiu/yolov9	❎ Refer to official export tutorial	✅
Detect	THU-MIG/yolov10	✅	✅
Detect	sunsmarterjie/yolov12	✅	✅
Detect	YOLO-World V2 (ultralytics)	✅	✅
Detect	THU-MIG/yoloe	✅	✅
Detect	ultralytics/ultralytics	✅	✅
Detect	PaddleDetection/PP-YOLOE+	✅	✅
Segment	ultralytics/yolov3	✅	✅
Segment	ultralytics/yolov5	✅	✅
Segment	meituan/YOLOv6-seg	❎ Implement yourself referring to tensorrt_yolo/export/head.py	🟢
Segment	WongKinYiu/yolov7	❎ Implement yourself referring to tensorrt_yolo/export/head.py	🟢
Segment	WongKinYiu/yolov9	❎ Implement yourself referring to tensorrt_yolo/export/head.py	🟢
Segment	THU-MIG/yoloe	✅	✅
Segment	ultralytics/ultralytics	✅	✅
Classify	ultralytics/yolov3	✅	✅
Classify	ultralytics/yolov5	✅	✅
Classify	ultralytics/ultralytics	✅	✅
Pose	ultralytics/ultralytics	✅	✅
OBB	ultralytics/ultralytics	✅	✅

🌟 Sponsorship & Support

Open-source projects thrive on support. If this project has been helpful to you, consider sponsoring the author. Your support is the greatest motivation for continued development!

--- 🙏 **A Heartfelt Thank You to Our Supporters and Sponsors**: > [!NOTE] > > The following is a list of sponsors automatically generated by GitHub Actions, updated daily ✨.

📄 License

TensorRT-YOLO is licensed under the **GPL-3.0 License**, an [OSI-approved](https://opensource.org/licenses/) open-source license that is ideal for students and enthusiasts, fostering open collaboration and knowledge sharing. Please refer to the [LICENSE](https://github.com/laugh12321/TensorRT-YOLO/blob/master/LICENSE) file for more details. Thank you for choosing TensorRT-YOLO; we encourage open collaboration and knowledge sharing, and we hope you comply with the relevant provisions of the open-source license. ##

📞 Contact

For bug reports and feature requests regarding TensorRT-YOLO, please visit [GitHub Issues](https://github.com/laugh12321/TensorRT-YOLO/issues)! ##

🙏 Thanks