April 21, 2024, 1 a.m. | /u/sgpfc

Machine Learning www.reddit.com

Automatically generated pairwise preference data and DPO alignment improve text-to-audio generation.

Code and dataset: [https://github.com/declare-lab/tango](https://github.com/declare-lab/tango)

Paper: [https://arxiv.org/abs/2404.09956](https://arxiv.org/abs/2404.09956)

Model: [https://huggingface.co/declare-lab/tango2](https://huggingface.co/declare-lab/tango2)

alignment audio audio generation data diffusion direct preference optimization dpo generated machinelearning optimization text through

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne