all AI news
OpenAI transcribed over a million hours of YouTube videos to train its LLMs, Google engaged in same practice
April 8, 2024, 10:16 a.m. | Rob Thubron
TechSpot www.techspot.com
In order to access more reputable English language-based text on the internet in 2021, OpenAI researchers created a speech recognition tool called Whisper, writes The New York Times. It was designed to transcribe audio from YouTube videos, giving the company a trove of data to train its LLMs.
Read Entire Article
audio english english language giving google in 2021 internet language llms openai practice recognition researchers speech speech recognition text the company the new york times tool train transcribe videos whisper youtube
More from www.techspot.com / TechSpot
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York