ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset. (arXiv:2208.08010v1 [cs.HC]) | allainews.com

Aug. 18, 2022, 1:10 a.m. | Zhihua Jin, Xingbo Wang, Furui Cheng, Chunhui Sun, Qun Liu, Huamin Qu

cs.LG updates on arXiv.org arxiv.org

Benchmark datasets play an important role in evaluating Natural Language
Understanding (NLU) models. However, shortcuts -- unwanted biases in the
benchmark datasets -- can damage the effectiveness of benchmark datasets in
revealing models' real capabilities. Since shortcuts vary in coverage,
productivity, and semantic meaning, it is challenging for NLU experts to
systematically understand and avoid them when creating benchmark datasets. In
this paper, we develop a visual analytics system, ShortcutLens, to help NLU
experts explore shortcuts in NLU benchmark datasets. …

analytics arxiv dataset language natural natural language understanding visual analytics

More from arxiv.org / cs.LG updates on arXiv.org

Differentially private Bayesian tests 11 hours ago | arxiv.org

abstract arxiv bayesian cs.cr +20

What Are We Optimizing For? A Human-centric Evaluation of Deep Learning-based Movie Recommenders 11 hours ago | arxiv.org

abstract accuracy arxiv benchmark +21

Attention-Enhanced Reservoir Computing 11 hours ago | arxiv.org

abstract accuracy arxiv attention +11

Learning finitely correlated states: stability of the spectral reconstruction 11 hours ago | arxiv.org

abstract arxiv cs.et cs.lg +10

Transfer Learning in Robotics: An Upcoming Breakthrough? A Review of Promises and Challenges 11 hours ago | arxiv.org

abstract agents arxiv challenges +17

The Perception-Robustness Tradeoff in Deterministic Image Restoration 11 hours ago | arxiv.org

abstract arxiv behavior consistent +13

Conformal Decision Theory: Safe Autonomous Decisions from Imperfect Predictions 11 hours ago | arxiv.org

abstract algorithms arxiv autonomous +20

Fin-Fact: A Benchmark Dataset for Multimodal Financial Fact Checking and Explanation Generation 11 hours ago | arxiv.org

arxiv benchmark cs.ai cs.ce +6

TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models 11 hours ago | arxiv.org

abstract arxiv capabilities challenge +16

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Data Engineer (m/f/d)

@ Project A Ventures | Berlin, Germany

View on ai-jobs.net

Principle Research Scientist

@ Analog Devices | US, MA, Boston

View on ai-jobs.net