Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization | allainews.com

April 16, 2024, 4:51 a.m. | Navonil Majumder, Chia-Yu Hung, Deepanway Ghosal, Wei-Ning Hsu, Rada Mihalcea, Soujanya Poria

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.09956v1 Announce Type: cross
Abstract: Generative multimodal content is increasingly prevalent in much of the content creation arena, as it has the potential to allow artists and media personnel to create pre-production mockups by quickly bringing their ideas to life. The generation of audio from text prompts is an important aspect of such processes in the music and film industry. Many of the recent diffusion-based text-to-audio models focus on training increasingly sophisticated diffusion models on a large set of datasets …

abstract arena artists arxiv audio create cs.ai cs.cl cs.sd diffusion direct preference optimization eess.as generative ideas life media multimodal multimodal content optimization production prompts text through type

More from arxiv.org / cs.CL updates on arXiv.org

UMass-BioNLP at MEDIQA-M3G 2024: DermPrompt -- A Systematic Exploration of Prompt Engineering with GPT-4V for … 15 hours ago | arxiv.org

abstract arxiv capabilities cases +18

Temporal Dynamics of Coordinated Online Behavior: Stability, Archetypes, and Influence 15 hours ago | arxiv.org

abstract art arxiv behavior +12

Multilingual large language models leak human stereotypes across language boundaries 15 hours ago | arxiv.org

abstract arxiv biases cs.cl +18

SlimPajama-DC: Understanding Data Combinations for LLM Training 15 hours ago | arxiv.org

arxiv cs.ai cs.cl data +5

IFDID: Information Filter upon Diversity-Improved Decoding for Diversity-Faithfulness Tradeoff in NLG 15 hours ago | arxiv.org

abstract arxiv cs.cl decoding +19

Exploring the Potential of Human-LLM Synergy in Advancing Qualitative Analysis: A Case Study on Mental-Illness … 15 hours ago | arxiv.org

abstract analysis arxiv case +22

Beyond Prompts: Learning from Human Communication for Enhanced AI Intent Alignment 15 hours ago | arxiv.org

abstract ai systems alignment arxiv +15

Can We Use Large Language Models to Fill Relevance Judgment Holes? 15 hours ago | arxiv.org

abstract arxiv build collection +13

One vs. Many: Comprehending Accurate Information from Multiple Erroneous and Inconsistent AI Generations 15 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +13

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net