all AI news
Open-ended VQA benchmarking of Vision-Language models by exploiting Classification datasets and their semantic hierarchy
Feb. 13, 2024, 5:43 a.m. | Simon Ging Mar\'ia A. Bravo Thomas Brox
cs.LG updates on arXiv.org arxiv.org
advance benchmark benchmarking benchmarks capabilities classification cs.cl cs.cv cs.lg datasets endeavor evaluation generative language language models limitations novel question question answering research semantic text understanding vision vision-language models visual
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote