Jan. 28, 2024, 4:49 p.m. | /u/Financial-Beach1587

Machine Learning www.reddit.com

Hey everyone! I'm excited to announce a major update to my open-source speech-to-text toolkit, WhisperS2T for the OpenAI Whisper model.

**Added TensorRT-LLM Support:**

* \~ 2x Inference Speedup: WhisperS2T now supports the TensorRT-LLM backend, achieving double the inference speed compared to the CTranslate2 backend! The current optimal configuration on an A30 GPU achieves transcription of 1-hour files in approximately 18 seconds.
* As far as I know, this is the first proper implementation of TensorRT-LLM for Whisper with batching and …

a30 backend current hey inference llm machinelearning major openai speech speech-to-text speed support tensorrt tensorrt-llm text toolkit update whisper

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer - AWS

@ 3Pillar Global | Costa Rica

Cost Controller/ Data Analyst - India

@ John Cockerill | Mumbai, India, India, India