http://arxiv.org/abs/2209.08199

Sept. 20, 2022 | Yu-Chung Hsiao, Fedir Zubach, Maria Wang, Jindong (JD) Chen

cs.CL updates on arXiv.org arxiv.org

We present a new task and dataset, ScreenQA, for screen content understanding
via question answering. The existing screen datasets are focused either on
structure and component-level understanding, or on a much higher-level
composite task such as navigation and task completion. We attempt to bridge the
gap between these two by annotating 80,000+ question-answer pairs over the RICO
dataset in hope to benchmark the screen reading comprehension capacity.

