Aug. 19, 2023, 10:23 a.m. | Aneesh Tickoo

MarkTechPost www.marktechpost.com

Multiple machine learning applications, including text, vision, and audio, have seen rapid and significant developments in the technology of generative models. The industry and society have felt significant effects of these developments. Notably, generative models with multi-modal input have become a truly innovative development. Zero-shot text-to-speech (TTS) is a well-known speech generation problem in the […]


The post Microsoft Researchers Introduce SpeechX: A Versatile Speech Generation Model Capable of Zero-Shot TTS and Various Speech Transformation Tasks appeared first on MarkTechPost …

ai shorts applications artificial intelligence audio become editors pick effects generative generative models industry language model large language model machine machine learning machine learning applications microsoft multi-modal multiple researchers society speech speech generation staff tasks tech news technology text transformation tts vision

More from www.marktechpost.com / MarkTechPost

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US