all AI news
Boter: Bootstrapping Knowledge Selection and Question Answering for Knowledge-based VQA
April 23, 2024, 4:47 a.m. | Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu
cs.CV updates on arXiv.org arxiv.org
Abstract: Knowledge-based Visual Question Answering (VQA) requires models to incorporate external knowledge to respond to questions about visual content. Previous methods mostly follow the "retrieve and generate" paradigm. Initially, they utilize a pre-trained retriever to fetch relevant knowledge documents, subsequently employing them to generate answers. While these methods have demonstrated commendable performance in the task, they possess limitations: (1) they employ an independent retriever to acquire knowledge solely based on the similarity between the query and …
abstract arxiv bootstrapping cs.cv documents fetch generate knowledge paradigm question question answering questions retriever them type visual vqa
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York