Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation | allainews.com

April 8, 2024, 4:44 a.m. | Ji-Jia Wu, Andy Chia-Hao Chang, Chieh-Yu Chuang, Chun-Pei Chen, Yu-Lun Liu, Min-Hung Chen, Hou-Ning Hu, Yung-Yu Chuang, Yen-Yu Lin

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.04231v1 Announce Type: new
Abstract: This paper addresses text-supervised semantic segmentation, aiming to learn a model capable of segmenting arbitrary visual concepts within images by using only image-text pairs without dense annotations. Existing methods have demonstrated that contrastive learning on image-text pairs effectively aligns visual segments with the meanings of texts. We notice that there is a discrepancy between text alignment and semantic segmentation: A text often consists of multiple semantic concepts, whereas semantic segmentation strives to create semantically homogeneous …

abstract annotations arxiv concepts cs.cv image images learn paper segmentation semantic text type visual visual concepts

More from arxiv.org / cs.CV updates on arXiv.org

Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges 23 hours ago | arxiv.org

abstract analysis arxiv challenges +11

ReFACT: Updating Text-to-Image Models by Editing the Text Encoder 23 hours ago | arxiv.org

abstract arxiv become challenge +17

Yuille-Poggio's Flow and Global Minimizer of Polynomials through Convexification by Heat Evolution 23 hours ago | arxiv.org

abstract algorithm arxiv cs.cv +9

Motion State: A New Benchmark Multiple Object Tracking 23 hours ago | arxiv.org

abstract analysis arxiv benchmark +18

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering 23 hours ago | arxiv.org

arxiv convolutional cs.ai cs.cv +10

A Unified Approach for Text- and Image-guided 4D Scene Generation 23 hours ago | arxiv.org

3d scene generation abstract arxiv cs.cv +17

From Pixels to Titles: Video Game Identification by Screenshots using Convolutional Neural Networks 23 hours ago | arxiv.org

abstract architectures arxiv cnn +24

Amodal Optical Flow 23 hours ago | arxiv.org

arxiv cs.ai cs.cv cs.ro +4

Interpretable Geoscience Artificial Intelligence (XGeoS-AI): Application to Demystify Image Recognition 23 hours ago | arxiv.org

abstract ai models application artificial +21

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net