Aug. 19, 2023, 10:23 a.m. | Aneesh Tickoo

MarkTechPost www.marktechpost.com

Multiple machine learning applications, including text, vision, and audio, have seen rapid and significant developments in the technology of generative models. The industry and society have felt significant effects of these developments. Notably, generative models with multi-modal input have become a truly innovative development. Zero-shot text-to-speech (TTS) is a well-known speech generation problem in the […]


The post Microsoft Researchers Introduce SpeechX: A Versatile Speech Generation Model Capable of Zero-Shot TTS and Various Speech Transformation Tasks appeared first on MarkTechPost …

ai shorts applications artificial intelligence audio become editors pick effects generative generative models industry language model large language model machine machine learning machine learning applications microsoft multi-modal multiple researchers society speech speech generation staff tasks tech news technology text transformation tts vision

More from www.marktechpost.com / MarkTechPost

Research Scholar (Technical Research)

@ Centre for the Governance of AI | Hybrid; Oxford, UK

HPC Engineer (x/f/m) - DACH

@ Meshcapade GmbH | Remote, Germany

Senior Analyst-Data Analysis

@ Tesco Bengaluru | Bengaluru, India

Data Engineer - Senior Associate

@ PwC | Brussels

People Data Analyst

@ Version 1 | London, United Kingdom

Senior Data Scientist

@ Palta | Simple Cyprus or remote