May 2, 2022, 1:11 a.m. | Alexander R. Fabbri, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong

cs.CL updates on arXiv.org arxiv.org

Factual consistency is an essential quality of text summarization models in
practical settings. Existing work in evaluating this dimension can be broadly
categorized into two lines of research, entailment-based and question answering
(QA)-based metrics, and different experimental setups often lead to contrasting
conclusions as to which paradigm performs the best. In this work, we conduct an
extensive comparison of entailment and QA-based metrics, demonstrating that
carefully choosing the components of a QA-based metric, especially question
generation and answerability classification, is …

arxiv evaluation qa summarization

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A