Using CLIP for knowledge distillation without a teacher model, using only teacher embeddings | allainews.com

April 12, 2024, 2:55 a.m. | /u/IllustriousSir_007

machinelearningnews www.reddit.com

Can pre-computed embeddings obtained from the teacher model be used to train the student model in knowledge distillation?

This project extends CLIP for efficient knowledge distillation, by utilizing embeddings as teachers. Typical knowledge distillation frameworks require running forward passes through a teacher model, which is often prohibitive in the case of billion or trillion parameter teachers. Using only the embeddings of the teacher models to guide the distillation can yield significant computational savings.

GitHub: https://github.com/lnairGT/CLIP-Distillation

clip distillation embeddings frameworks knowledge machinelearningnews project running teachers through train

More from www.reddit.com / machinelearningnews

Prometheus 2: An Open Source Language Model that Closely Mirrors Human and GPT-4 Judgements in … 13 hours ago | www.reddit.com

gpt gpt-4 human language +6

Researchers at NVIDIA AI Introduce ‘VILA’: A Vision Language Model that can Reason Among Multiple … 17 hours ago | www.reddit.com

context images language language model +10

This AI Paper by Scale AI Introduces GSM1k for Measuring Reasoning Accuracy in Large Language … 1 day, 1 hour ago | www.reddit.com

accuracy ai paper language language models +9

Researchers at Stanford Introduce SUQL: A Formal Query Language for Integrating Structured and Unstructured Data 1 day, 7 hours ago | www.reddit.com

data language machinelearningnews query +5

Nexa AI Introduces Octopus v4: A Novel Artificial Intelligence Approach that Employs Functional Tokens to … 1 day, 14 hours ago | www.reddit.com

artificial artificial intelligence functional intelligence +5

A Survey of RAG and RAU: Advancing Natural Language Processing with Retrieval-Augmented Language Models 1 day, 18 hours ago | www.reddit.com

language language models language processing machinelearningnews +8

Google DeepMind Introduces Med-Gemini: A Groundbreaking Family of AI Models Revolutionizing Medical Diagnosis and Clinical … 2 days, 2 hours ago | www.reddit.com

ai models clinical deepmind diagnosis +8

This AI Paper Introduces Llama-3-8B-Instruct-80K-QLoRA: New Horizons in AI Contextual Understanding 2 days, 11 hours ago | www.reddit.com

ai paper llama machinelearningnews paper +2

FREE AI WEBINAR: 'Using AWS Bedrock & LangChain for Private LLM App Dev' 2 days, 14 hours ago | www.reddit.com

ai webinar app aws aws bedrock +8

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net