[P] TensorRT-LLM Backend for WhisperS2T (~2x Speedup than CTranslate2) | allainews.com

Jan. 28, 2024, 4:49 p.m. | /u/Financial-Beach1587

Machine Learning www.reddit.com

Hey everyone! I'm excited to announce a major update to my open-source speech-to-text toolkit, WhisperS2T for the OpenAI Whisper model.

**Added TensorRT-LLM Support:**

* \~ 2x Inference Speedup: WhisperS2T now supports the TensorRT-LLM backend, achieving double the inference speed compared to the CTranslate2 backend! The current optimal configuration on an A30 GPU achieves transcription of 1-hour files in approximately 18 seconds.
* As far as I know, this is the first proper implementation of TensorRT-LLM for Whisper with batching and …

a30 backend current hey inference llm machinelearning major openai speech speech-to-text speed support tensorrt tensorrt-llm text toolkit update whisper

More from www.reddit.com / Machine Learning

How much coursework is required to land an entry-level ML job? [D] 4 hours ago | www.reddit.com

berkeley building epidemiology job +4

[D] Foundational papers for Graph Adversarial Learning? 5 hours ago | www.reddit.com

machinelearning papers understanding

[D] Suggestions for NLP Papers Commonly Implemented in ML Interviews 16 hours ago | www.reddit.com

companies implementation interview interviews +10

[D] How can attention mechanisms retrieve meaningful information over long distances when using RoPE or … 19 hours ago | www.reddit.com

attention attention mechanisms information machinelearning +3

[D] Do Lead's in an AI/DS/ML team always have PhDs, is it a requirement? 20 hours ago | www.reddit.com

hello lecture machinelearning masters +3

[D] Correct me if I'm wrong, use KL divergence for NLP, and MMD for CV. … 1 day ago | www.reddit.com

distribution divergence fields found +5

[R] New Teleoperation Tool with VisionPro 1 day, 4 hours ago | www.reddit.com

machinelearning teleoperation tool

[R] Dynamic Gaussians Mesh 1 day, 4 hours ago | www.reddit.com

dynamic machinelearning mesh

[D] ICML 2024 results 1 day, 9 hours ago | www.reddit.com

conference current decisions discussions +11

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer - AWS

@ 3Pillar Global | Costa Rica

View on ai-jobs.net

Cost Controller/ Data Analyst - India

@ John Cockerill | Mumbai, India, India, India

View on ai-jobs.net