Sept. 21, 2023, 4:19 p.m. | /u/Successful-Western27

Artificial Intelligence www.reddit.com

Edit: FLAC is the tested audio extension, not MP3

I read the new paper from DeepMind so you don't have to. Here are the key highlights:

* Despite training on text, **langauge models compressed images 43% better than PNG, and audio nearly 2x better than flac.**
* Confirmation of scaling laws - **bigger models compressed better.** But model size must match dataset size.
* There are **tradeoffs between model scale, data size, and compression** performance. More data enables bigger models. …

artificial audio deepmind edit extension highlights images llms mp3 paper text the key training

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Software Engineer, Generative AI (C++)

@ SoundHound Inc. | Toronto, Canada