Dec. 14, 2023, 11:03 p.m. | Sana Hassan

MarkTechPost www.marktechpost.com

Researchers from Alibaba Group introduced Qwen-Audio, which addresses the challenge of limited pre-trained audio models for diverse tasks. A hierarchical tag-based multi-task framework is designed to avoid interference issues from co-training. Qwen-Audio achieves impressive performance across benchmark tasks without task-specific fine-tuning. Qwen-Audio-Chat, built upon Qwen-Audio, supports multi-turn dialogues and diverse audio-central scenarios, demonstrating its universal […]


The post Alibaba Researchers Introduce Qwen-Audio Series: A Set of Large-Scale Audio-Language Models with Universal Audio Understanding Abilities appeared first on MarkTechPost.

ai shorts alibaba alibaba group applications artificial intelligence audio benchmark challenge chat diverse editors pick fine-tuning framework hierarchical language language model language models large language model machine learning performance qwen researchers scale series set staff tag tasks tech news technology training understanding

More from www.marktechpost.com / MarkTechPost

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US