all AI news
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures
March 25, 2024, 4:47 a.m. | Agrima Seth, Sanchit Ahuja, Kalika Bali, Sunayana Sitaram
cs.CL updates on arXiv.org arxiv.org
Abstract: Generative models are increasingly being used in various applications, such as text generation, commonsense reasoning, and question-answering. To be effective globally, these models must be aware of and account for local socio-cultural contexts, making it necessary to have benchmarks to evaluate the models for their cultural familiarity. Since the training data for LLMs is web-based and the Web is limited in its representation of information, it does not capture knowledge present within communities that are …
abstract applications arxiv benchmarks cs.cl cs.cy dataset generative generative models indian making question reasoning social text text generation type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead Data Engineer
@ WorkMoney | New York City, United States - Remote