March 26, 2024, 4:51 a.m. | Shinka Mori, Oana Ignat, Andrew Lee, Rada Mihalcea

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.16909v1 Announce Type: cross
Abstract: Synthetic data generation has the potential to impact applications and domains with scarce data. However, before such data is used for sensitive tasks such as mental health, we need an understanding of how different demographics are represented in it. In our paper, we analyze the potential of producing synthetic data using GPT-3 by exploring the various stressors it attributes to different race and gender combinations, to provide insight for future researchers looking into using LLMs …

abstract applications arxiv cs.ai cs.cl cs.cy data demographics domains fidelity generated health however human impact mental health paper representation synthetic synthetic data tasks type understanding

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Machine Learning Engineer (AI, NLP, LLM, Generative AI)

@ Palo Alto Networks | Santa Clara, CA, United States

Consultant Senior Data Engineer F/H

@ Devoteam | Nantes, France