all AI news
FuseGen: PLM Fusion for Data-generation based Zero-shot Learning
June 19, 2024, 4:41 a.m. | Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang
cs.CL updates on arXiv.org arxiv.org
Abstract: Data generation-based zero-shot learning, although effective in training Small Task-specific Models (STMs) via synthetic datasets generated by Pre-trained Language Models (PLMs), is often limited by the low quality of such synthetic datasets. Previous solutions have primarily focused on single PLM settings, where synthetic datasets are typically restricted to specific sub-spaces and often deviate from real-world distributions, leading to severe distribution bias. To mitigate such bias, we propose FuseGen, a novel data generation-based zero-shot learning framework …
More from arxiv.org / cs.CL updates on arXiv.org
ReFT: Reasoning with Reinforced Fine-Tuning
2 days, 23 hours ago |
arxiv.org
Exploring Defeasibility in Causal Reasoning
2 days, 23 hours ago |
arxiv.org
A Large Language Model Approach to Educational Survey Feedback Analysis
2 days, 23 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Engineer
@ Michelin | Pune
Senior Research Analyst, Global Analytics & Measurement
@ NBCUniversal | London, United Kingdom
Research Scientist
@ Meta | Menlo Park, CA
NLP Speech Research Intern
@ Tencent | UK-London
Junior Senior Reliability Engineer
@ NielsenIQ | Bogotá, Colombia
[Job - 15712] Vaga Afirmativa para Mulheres - QA (Automation), SR
@ CI&T | Brazil