May 15, 2023, 12:47 a.m. | Ruixiang Jiang, Lingbo Liu, Changwen Chen

cs.CV updates on arXiv.org arxiv.org

Recent advances in visual-language models have shown remarkable zero-shot
text-image matching ability that is transferable to down-stream tasks such as
object detection and segmentation. However, adapting these models for object
counting, which involves estimating the number of objects in an image, remains
a formidable challenge. In this study, we conduct the first exploration of
transferring visual-language models for class-agnostic object counting.
Specifically, we propose CLIP-Count, a novel pipeline that estimates density
maps for open-vocabulary objects with text guidance in a …

arxiv challenge clip count detection image language language models objects segmentation study text text-image

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US