Web: http://arxiv.org/abs/2205.05050

May 11, 2022, 1:11 a.m. | Arshdeep Sekhon, Yangfeng Ji, Matthew B. Dwyer, Yanjun Qi

cs.CL updates on arXiv.org arxiv.org

Recent literature has seen growing interest in using black-box strategies
like CheckList for testing the behavior of NLP models. Research on white-box
testing has developed a number of methods for evaluating how thoroughly the
internal behavior of deep models is tested, but they are not applicable to NLP
models. We propose a set of white-box testing methods that are customized for
transformer-based NLP models. These include Mask Neuron Coverage (MNCOVER) that
measures how thoroughly the attention layers in models are …

arxiv models nlp testing

