Learning Distinct and Representative Modes for Image Captioning. (arXiv:2209.08231v1 [cs.CV]) | allainews.com

Sept. 20, 2022, 1:12 a.m. | Qi Chen, Chaorui Deng, Qi Wu

cs.CV updates on arXiv.org arxiv.org

Over the years, state-of-the-art (SoTA) image captioning methods have
achieved promising results on some evaluation metrics (e.g., CIDEr). However,
recent findings show that the captions generated by these methods tend to be
biased toward the "average" caption that only captures the most general mode
(a.k.a, language pattern) in the training corpus, i.e., the so-called mode
collapse problem. Affected by it, the generated captions are limited in
diversity and usually less informative than natural image descriptions made by
humans. In this …

arxiv captioning image

More from arxiv.org / cs.CV updates on arXiv.org

Pix2HDR -- A pixel-wise acquisition and deep learning-based synthesis approach for high-speed HDR videos 21 hours ago | arxiv.org

abstract acquisition applications arxiv +16

LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization 21 hours ago | arxiv.org

abstract algorithms analysis arxiv +17

Unsupervised Representation Learning for 3D MRI Super Resolution with Degradation Adaptation 21 hours ago | arxiv.org

abstract arxiv cs.cv deep learning +16

Accurate Spatial Gene Expression Prediction by integrating Multi-resolution features 21 hours ago | arxiv.org

abstract analysis arxiv costs +17

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts 21 hours ago | arxiv.org

abstract arxiv attention control +10

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs 21 hours ago | arxiv.org

abstract arxiv capabilities clip +21

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS 21 hours ago | arxiv.org

arxiv cs.cv cs.gr type

FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation 21 hours ago | arxiv.org

arxiv cs.cv cs.ro lidar +4

A Systematic Review of Deep Learning-based Research on Radiology Report Generation 21 hours ago | arxiv.org

abstract arxiv automation clinical +18

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Program Control Data Analyst

@ Ford Motor Company | Mexico

View on ai-jobs.net

Vice President, Business Intelligence / Data & Analytics

@ AlphaSense | Remote - United States

View on ai-jobs.net