Oct. 10, 2023, 2 p.m. | Anthony Alford

InfoQ - AI, ML & Data Engineering www.infoq.com

Harmonai, the audio research lab of Stability AI, has released Stable Audio, a diffusion model for text-controlled audio generation. Stable Audio is trained on 19,500 hours of audio data and can generate 44.1kHz quality audio in realtime using a single NVIDIA A100 GPU.

By Anthony Alford

a100 a100 gpu ai anthony audio audio generation data deep learning diffusion diffusion model generate generative generative-ai gpu harmonai lab ml & data engineering neural networks nvidia nvidia a100 nvidia a100 gpu quality realtime releases research stability stability ai stable audio text

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote