June 15, 2022, 1:10 a.m. | Andreas Triantafyllopoulos, Meishu Song, Zijiang Yang, Xin Jing, Björn W. Schuller

cs.LG updates on arXiv.org arxiv.org

In this work, we explore a novel few-shot personalisation architecture for
emotional vocalisation prediction. The core contribution is an `enrolment'
encoder which utilises two unlabelled samples of the target speaker to adjust
the output of the emotion encoder; the adjustment is based on dot-product
attention, thus effectively functioning as a form of `soft' feature selection.
The emotion and enrolment encoders are based on two standard audio
architectures: CNN14 and CNN10. The two encoders are further guided to forget
or learn …

arxiv personalisation prediction

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Senior Applied Data Scientist

@ dunnhumby | London

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV