[D] [R] Are there any methods/works that enable extracting high-quality dense feature map from CLIP/OpenCLIP image encoders without large scale finetuning? | allainews.com

May 5, 2024, 3:41 a.m. | /u/Tensor_Devourer_56

Machine Learning www.reddit.com

Hi, as stated in the title, I'm curious if such methods exist. We know that (trained) CLIP's image and text encoders both output an 1D vector that are aligned in the latent space, which allows to easily compute the similarities between a batch of images and texts. However, in many vision applications, it is desirable to get a 3D feature map of shape C\*H\*W. Ideally, if the vector at each spatial location in this feature map is as high-quality as …

clip compute feature finetuning image machinelearning map quality scale space text vector

More from www.reddit.com / Machine Learning

[R] Grounding DINO 1.5 Release: the most capable open-set detection model 5 hours ago | www.reddit.com

building dataset detection foundation +12

[D] Foundational Time Series Models Overrated? 6 hours ago | www.reddit.com

chronos domain etc example +13

[R] Do Llamas Work in English? On the Latent Language of Multilingual Transformers 6 hours ago | www.reddit.com

abstract bias colab english +19

[R] Robust agents learn causal world models 6 hours ago | www.reddit.com

abstract agent agents biases +14

[D] Library for named entity recognition 7 hours ago | www.reddit.com

library machinelearning mean recognition +3

[N] ICML 2024 Workshop on making discrete operations differentiable 🤖 8 hours ago | www.reddit.com

clustering deep learning differentiable everything +12

[P] GPT-Burn: A simple & concise implementation of the GPT in pure Rust 🔥 13 hours ago | www.reddit.com

gpt implementation machinelearning rust +1

[R] 1:10 Radio Controlled Car autonomous driving 18 hours ago | www.reddit.com

advice autonomous autonomous driving cameras +13

[P] How to keep only the top 10K most common tokens (transformers library) 21 hours ago | www.reddit.com

huggingface machinelearning tokens

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net