A new technique is making waves in zero-shot semantic segmentation by smartly combining self-supervised learning with the multimodal powers of CLIP | allainews.com

Feb. 26, 2024, 9:44 a.m. | /u/Basic_AI

Computer Vision www.reddit.com

Models like CLIP wowed us by interacting seamlessly with text prompts without any training samples. But its lack of spatial skills made dense prediction tasks like image segmentation tough without extensive fine-tuning which can dampen that zero-shot flair. Self-supervised models like DINO, however, showcased some robust spatial representations without label-reliance.

Bringing these strengths together, the new CLIP-DINOiser framework fuses DINO’s self-supervised image features with CLIP’s zero-shot classifier to pull off zero-shot segmentation that can hold its own against fully-supervised approaches. …

clip computervision fine-tuning flair image making multimodal prediction prompts samples segmentation self-supervised learning semantic skills spatial supervised learning tasks text training zero-shot

More from www.reddit.com / Computer Vision

Is it possible to calculate the distance of an object using a single camera? 7 hours ago | www.reddit.com

cameras computervision feature flair +4

KAN: Kolmogorov–Arnold Networks - For Computer Vision 23 hours ago | www.reddit.com

computer computer vision computervision latest +4

Object detection evaluation - FROC analysis 1 day, 7 hours ago | www.reddit.com

analysis coco code computervision +8

Pose Estimation Given CAD Model 1 day, 12 hours ago | www.reddit.com

cad computation computervision current +6

I got asked what my “credentials” are because I suggested compression 1 day, 16 hours ago | www.reddit.com

big client compression computervision +5

Training an Unbeatable Connect 4 Ai 1 day, 18 hours ago | www.reddit.com

computervision training

Introduction to Computer Vision by Hany Farid, UC Berkeley 1 day, 18 hours ago | www.reddit.com

berkeley computer computer vision computervision +3

Dealing with class imbalance? 1 day, 18 hours ago | www.reddit.com

algorithms class classification computervision +12

Help with instance detection project, classical CV 2 days, 13 hours ago | www.reddit.com

computervision detection filter identify +7

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst (Digital Business Analyst)

@ Activate Interactive Pte Ltd | Singapore, Central Singapore, Singapore

View on ai-jobs.net