Feb. 14, 2024, 10:50 p.m. | /u/SaladChefs

Machine Learning www.reddit.com

A while ago, we shared our [Whisper Large v2 benchmark](https://www.reddit.com/r/MachineLearning/comments/16ftd9v/p_whisper_large_benchmark_137_days_of_audio/) in this community and there was considerable interest and discussion around it.

Here's the follow-up: **Whisper Large v3 benchmark.**

**The Result: 1 Million hours of audio transcribed on consumer GPUs for just $5110.**

That's around **11,736 mins per dollar** \- 10X more than our Whisper Large v2 benchmark (1681 mins per dollar).

A 99.8% cost savings compared to managed transcription services.

## Deployment

We created a container group with **100 …

audio benchmark consumer cost cost savings deployment gpus machinelearning managed per services transcription transcription services whisper

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne