all AI news
Learning Distinct and Representative Modes for Image Captioning. (arXiv:2209.08231v1 [cs.CV])
Sept. 20, 2022, 1:12 a.m. | Qi Chen, Chaorui Deng, Qi Wu
cs.CV updates on arXiv.org arxiv.org
Over the years, state-of-the-art (SoTA) image captioning methods have
achieved promising results on some evaluation metrics (e.g., CIDEr). However,
recent findings show that the captions generated by these methods tend to be
biased toward the "average" caption that only captures the most general mode
(a.k.a, language pattern) in the training corpus, i.e., the so-called mode
collapse problem. Affected by it, the generated captions are limited in
diversity and usually less informative than natural image descriptions made by
humans. In this …
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Program Control Data Analyst
@ Ford Motor Company | Mexico
Vice President, Business Intelligence / Data & Analytics
@ AlphaSense | Remote - United States