all AI news
Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering. (arXiv:2204.10448v1 [cs.CV])
cs.LG updates on arXiv.org arxiv.org
Knowledge-based visual question answering (QA) aims to answer a question
which requires visually-grounded external knowledge beyond image content
itself. Answering complex questions that require multi-hop reasoning under weak
supervision is considered as a challenging problem since i) no supervision is
given to the reasoning process and ii) high-order semantics of multi-hop
knowledge facts need to be captured. In this paper, we introduce a concept of
hypergraph to encode high-level semantics of a question and a knowledge base,
and to learn …
arxiv cv hypergraph knowledge question answering reasoning transformer weakly-supervised