all AI news
ChroniclingAmericaQA: A Large-scale Question Answering Dataset based on Historical American Newspaper Pages
March 27, 2024, 4:48 a.m. | Bhawna Piryani, Jamshid Mozafari, Adam Jatowt
cs.CL updates on arXiv.org arxiv.org
Abstract: Question answering (QA) and Machine Reading Comprehension (MRC) tasks have significantly advanced in recent years due to the rapid development of deep learning techniques and, more recently, large language models. At the same time, many benchmark datasets have become available for QA and MRC tasks. However, most existing large-scale benchmark datasets have been created predominantly using synchronous document collections like Wikipedia or the Web. Archival document collections, such as historical newspapers, contain valuable information from …
abstract advanced arxiv become benchmark cs.cl dataset datasets deep learning deep learning techniques development language language models large language large language models machine question question answering reading scale tasks type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
DevOps Engineer (Data Team)
@ Reward Gateway | Sofia/Plovdiv