[Research] [Project] Text-to-Audio Generation using Instruction-Tuned LLM and Latent Diffusion Model | allainews.com

May 4, 2023, 4:44 p.m. | /u/bideex

Machine Learning www.reddit.com

Paper: [https://arxiv.org/abs/2304.13731](https://arxiv.org/abs/2304.13731)

Code: [https://github.com/declare-lab/tango](https://github.com/declare-lab/tango)

Demo: [https://huggingface.co/spaces/declare-lab/tango](https://huggingface.co/spaces/declare-lab/tango)

Project: [https://tango-web.github.io/](https://tango-web.github.io/)

Abstract: The immense scale of the recent large language models (LLM) allows many interesting properties, such as, instruction- and chain-of-thought-based fine-tuning, that has significantly improved zero- and few-shot performance in many natural language processing (NLP) tasks. Inspired by such successes, we adopt such an instruction-tuned LLM FLAN-T5 as the text encoder for text-to audio (TTA) generation—a task where the goal is to generate an audio from its textual description. The prior works …

abstract audio encoder fine-tuning language language models language processing large language models llm machinelearning natural natural language natural language processing nlp performance processing scale text thought

More from www.reddit.com / Machine Learning

[N] Kaiming He's lecture on DL architecture for Representation Learning 5 hours ago | www.reddit.com

advances architecture good lecture +3

Do you think Reinforcement Learning still got it? [D] 9 hours ago | www.reddit.com

alphago architectures big computer +15

[P] TorchFix - a linter for PyTorch-using code with autofix support 11 hours ago | www.reddit.com

machinelearning

[D] Is Google Set to Dominate the RAG Scene with Its Massive Data Resources? 13 hours ago | www.reddit.com

basic big data google +16

[P] AI-based Language Teacher that can run locally on a 12GB graphics card (RTX 4070) 15 hours ago | www.reddit.com

application card fun graphics +7

[D] Embeddings search "drowning" in a sea of noise! Can you solve this riddle? 16 hours ago | www.reddit.com

application concept dimensions embeddings +15

Any ways to improve TabNet..??? [D] 22 hours ago | www.reddit.com

machinelearning

[R] Machine learning from 3D meshes and physical fields 22 hours ago | www.reddit.com

machinelearning

[Discussion] Are there specific technical/scientific breakthroughs that have allowed the significant jump in maximum context … 23 hours ago | www.reddit.com

claude context gpt gpt-4 +14

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst

@ SEAKR Engineering | Englewood, CO, United States

View on ai-jobs.net

Data Analyst II

@ Postman | Bengaluru, India

View on ai-jobs.net

Data Architect

@ FORSEVEN | Warwick, GB

View on ai-jobs.net

Director, Data Science

@ Visa | Washington, DC, United States

View on ai-jobs.net

Senior Manager, Data Science - Emerging ML

@ Capital One | McLean, VA

View on ai-jobs.net