Oct. 7, 2022, 4:46 p.m. | /u/valdanylchuk

Machine Learning www.reddit.com

LM-based; in contrast to other recent audio generation experiments which worked from transcribed text or midi notes, AudioLM works directly based on the audio signal, resulting in outstanding consistency and high fidelity sound.

Google blog post from yesterday: [https://ai.googleblog.com/2022/10/audiolm-language-modeling-approach-to.html](https://ai.googleblog.com/2022/10/audiolm-language-modeling-approach-to.html)

Demo clip on Youtube: [https://www.youtube.com/watch?v=\_xkZwJ0H9IU](https://www.youtube.com/watch?v=_xkZwJ0H9IU)

Paper: [https://arxiv.org/abs/2209.03143](https://arxiv.org/abs/2209.03143)

Abstract:

>We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in …

audiolm google machinelearning quality voice

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

AI Engineering Manager

@ M47 Labs | Barcelona, Catalunya [Cataluña], Spain