April 5, 2024, 4:45 a.m. | Izumi Fujimori, Masaki Oono, Masami Shishibori

cs.CV updates on arXiv.org arxiv.org

arXiv:2404.03394v1 Announce Type: new
Abstract: In weakly-supervised semantic segmentation (WSSS) using only image-level class labels, a problem with CNN-based Class Activation Maps (CAM) is that they tend to activate the most discriminative local regions of objects. On the other hand, methods based on Transformers learn global features but suffer from the issue of background noise contamination. This paper focuses on addressing the issue of background noise in attention weights within the existing WSSS method based on Conformer, known as TransCAM. …

abstract arxiv attention class cnn cs.cv features global image labels learn map maps noise objects segmentation semantic transformers type weakly-supervised

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US