July 5, 2022, 1:12 a.m. | Keon Lee, Kyumin Park, Daeyoung Kim

cs.CL updates on arXiv.org arxiv.org

The majority of current TTS datasets, which are collections of individual
utterances, contain few conversational aspects in terms of both style and
metadata. In this paper, we introduce DailyTalk, a high-quality conversational
speech dataset designed for Text-to-Speech. We sampled, modified, and recorded
2,541 dialogues from the open-domain dialogue dataset DailyDialog which are
adequately long to represent context of each dialogue. During the data
construction step, we maintained attributes distribution originally annotated
in DailyDialog to support diverse dialogue in DailyTalk. On …

arxiv conversational dataset speech text text-to-speech

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Data Science Sustainability Co-Op (Summer & Fall 2024)

@ O-I | Perrysburg, OH, United States

Research Scientist

@ Chevron Phillips Chemical Company | USA: Kingwood, TX, US, 77339

Data Scientist Python (Django) (m/f/d)

@ RoomPriceGenie | Hybrid Mannheim, Remote DACH, Remote Germany

Operational Transformation & Strategy - Data Operations - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India

Senior Data Scientist

@ Rocket Travel | Chicago, IL USA