Feb. 20, 2024, 5:43 a.m. | Nineli Lashkarashvili, Wen Wu, Guangzhi Sun, Philip C. Woodland

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.11747v1 Announce Type: cross
Abstract: Foundation models have shown superior performance for speech emotion recognition (SER). However, given the limited data in emotion corpora, finetuning all parameters of large pre-trained models for SER can be both resource-intensive and susceptible to overfitting. This paper investigates parameter-efficient finetuning (PEFT) for SER. Various PEFT adaptors are systematically studied for both classification of discrete emotion categories and prediction of dimensional emotional attributes. The results demonstrate that the combination of PEFT methods surpasses full finetuning …

abstract arxiv cs.lg cs.sd data domain domain adaptation eess.as emotion finetuning foundation overfitting paper parameters peft performance pre-trained models recognition speech speech emotion type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US