Sept. 29, 2022, 1:11 a.m. | Tobias Hallmen, Silvan Mertes, Dominik Schiller, Elisabeth André

cs.LG updates on arXiv.org arxiv.org

Affective speech analysis is an ongoing topic of research. A relatively new
problem in this field is the analysis of vocal bursts, which are nonverbal
vocalisations such as laughs or sighs. Current state-of-the-art approaches to
address affective vocal burst analysis are mostly based on wav2vec2 or HuBERT
features. In this paper, we investigate the use of the wav2vec successor
data2vec in combination with a multitask learning pipeline to tackle different
analysis problems at once. To assess the performance of our …

analysis architecture arxiv multitask learning

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote