Dec. 17, 2023, 8:49 a.m. | /u/Financial-Beach1587

Machine Learning www.reddit.com

The growing buzz around optimized speech-to-text pipelines for the OpenAI Whisper model sparked my drive to unveil this open-source side project - a lightning-fast speeich-to-text pipeline tailored for the whisper model! It's ~1.5X times faster than WhisperX and ~2X times faster than the HuggingFace Pipeline with FlashAttention 2 (Insanely Fast Whisper) on an A30 GPU. WhisperS2T also includes several heuristics to improve transcription accuracies. 🚀🗣️

🔗 Dive into the code on GitHub: https://github.com/shashikg/WhisperS2T

📝 Stay tuned! I'm gearing up to …

drive faster huggingface lightning machinelearning openai pipeline pipelines project speech speech-to-text text whisper

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US