Distincive Image Captioning via CLIP Guided Group Optimization. (arXiv:2208.04254v2 [cs.CV] UPDATED) | allainews.com

Aug. 11, 2022, 1:12 a.m. | Youyuan Zhang, Jiuniu Wang, Hao Wu, Wenjia Xu

cs.CV updates on arXiv.org arxiv.org

Image captioning models are usually trained according to human annotated
ground-truth captions, which could generate accurate but generic captions. In
this paper, we focus on generating the distinctive captions that can
distinguish the target image from other similar images. To evaluate the
distinctiveness of captions, we introduce a series of metrics that use
large-scale vision-language pre-training model CLIP to quantify the
distinctiveness. To further improve the distinctiveness of captioning models,
we propose a simple and effective training strategy which trains …

arxiv captioning clip cv image optimization

More from arxiv.org / cs.CV updates on arXiv.org

Image Restoration by Denoising Diffusion Models with Iteratively Preconditioned Guidance 12 hours ago | arxiv.org

abstract algorithms arxiv become +17

ParamISP: Learned Forward and Inverse ISPs using Camera Parameters 12 hours ago | arxiv.org

abstract arxiv cs.cv data +11

LLM-driven Multimodal Target Volume Contouring in Radiation Oncology 12 hours ago | arxiv.org

abstract advancement arxiv clinical +20

Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering 12 hours ago | arxiv.org

abstract advances arxiv bottlenecks +24

Multi-scale Attention Network for Single Image Super-Resolution 12 hours ago | arxiv.org

abstract arxiv attention cs.cv +10

Post-processing of coronary and myocardial spatial data 12 hours ago | arxiv.org

abstract arxiv computational context +17

RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model 12 hours ago | arxiv.org

abstract analysis arxiv building +24

Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs 12 hours ago | arxiv.org

arxiv cs.cv masks type +1

Objects With Lighting: A Real-World Dataset for Evaluating Reconstruction and Rendering for Object Relighting 12 hours ago | arxiv.org

arxiv cs.cv cs.gr dataset +6

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Associate Data Engineer

@ Redkite | London, England, United Kingdom

View on ai-jobs.net

Data Management Associate Consultant

@ SAP | Porto Salvo, PT, 2740-262

View on ai-jobs.net

NLP & Data Modelling Consultant - SAP LABS

@ SAP | Bengaluru, IN, 560066

View on ai-jobs.net

Catalog Data Quality Specialist

@ Delivery Hero | Montevideo, Uruguay

View on ai-jobs.net

Data Analyst for CEO Office with Pathway to Functional Analyst

@ Amar Bank | Jakarta

View on ai-jobs.net