Feb. 14, 2024, 1:43 p.m. | /u/ivan_kudryavtsev

Computer Vision www.reddit.com

We created a benchmark comparing the serving of YOLOV8M model (640x640, bs=1) with three different approaches:

* PyTorch CUDA + OpenCV;
* PyTorch CUDA + Torchaudio (hardware decoding with NVDEC);
* Savant (TensorRT, hardware decoding with NVDEC).

Savant demonstrated threefold performance versus naive PyTorch CUDA + OpenCV and more than twofold versus PyTorch CUDA + Torchaudio. The numbers are for GeForce RTX 2080 and Intel Core i5-8600K CPU @ 3.60GHz / 32 GB RAM.

|**Benchmark**|**FPS**|**Improvement**|
|:-|:-|:-|
|Pytorch CUDA + OpenCV|75|0.294| …

benchmark computervision cuda decoding faster hardware opencv performance pytorch tensorrt yolov8

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US