Intro to text-image (multi-modal) ML with OpenAI's CLIP | allainews.com

Aug. 11, 2022, 2:57 p.m. | /u/jamescalam

Natural Language Processing www.reddit.com

Hi all, I created a [walkthrough](https://towardsdatascience.com/quick-fire-guide-to-multi-modal-ml-with-openais-clip-2dad7e398ac0?sk=89bb2d8b8e583ed109d8a05e00366645) (and [video](https://youtu.be/989aKUVBfbk)) demoing how to use the text and image embeddings of OpenAI's CLIP. CLIP is a multi-modal model that uses a typical text transformer for text embeddings and a vision transformer (ViT, alt version uses Resnet) for image embeddings. During pretraining CLIP learns to place (image, text) pairs into the same vector space. The result is a cool off-the-shelf model that can perform tasks across image and text data.

When I started using …

clip image languagetechnology ml openai text text-image

More from www.reddit.com / Natural Language Processing

Feeling so inferior in the NLP job market. 12 hours ago | www.reddit.com

job language languagetechnology master +5

Online course Recommendations : I need to take some programming and CS or ai courses … 2 days, 19 hours ago | www.reddit.com

admissions ai courses computational computer +17

ReFT: Representation Finetuning for Language Models 1 week, 1 day ago | www.reddit.com

abstract languagetechnology

Stanford CS 25 Transformers Course (OPEN TO EVERYBODY) 1 week, 3 days ago | www.reddit.com

andrej karpathy applications architectures art +20

Best Masters Program? 1 week, 3 days ago | www.reddit.com

computational language languagetechnology linguistics +4

How is Claude able to tokenize "!Bmpvfuuf-!hfoujmmf!bmpvfuuf!Bmpvfuuf-!kf!uf!qmvnfsbj!Kf!uf!qmvnfsbj!mb!u�uf!Kf!uf!qmvnfsbj!mb!u�uf!Fu!mb!u�uf-!fu!mb!u�uf!Bmpvfuuf-!Bmpvfuuf!" ? 1 week, 5 days ago | www.reddit.com

claude claude ai languagetechnology responded

I just can't fine tune BERT over 40% accuracy for text-classification task 1 week, 6 days ago | www.reddit.com

accuracy bert classification data +12

Stanford CS 25 Transformers Course (Open to Everybody | Starts Tomorrow) 2 weeks, 2 days ago | www.reddit.com

authors chat course deep learning +9

Book recs for modern text analytics (need insights for stakeholders) 2 weeks, 3 days ago | www.reddit.com

advice analysis analytics analyze +19

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst (CPS-GfK)

@ GfK | Bucharest

View on ai-jobs.net

Consultant Data Analytics IT Digital Impulse - H/F

@ Talan | Paris, France

View on ai-jobs.net

Data Analyst

@ Experian | Mumbai, India

View on ai-jobs.net

Data Scientist

@ Novo Nordisk | Princeton, NJ, US

View on ai-jobs.net

Data Architect IV

@ Millennium Corporation | United States

View on ai-jobs.net