[D] SOTA LLM distillation? | allainews.com

May 27, 2023, 1:25 p.m. | /u/kkimdev

Machine Learning www.reddit.com

There has been a lot of distillation research & application on BERT and its variants. I was wondering why we don't see much distillation research on GPT-3 size level LLMs?

Can anyone familiar with LLM distillation share some insights? Thanks in advance!

advance application bert distillation gpt gpt-3 insights llm llms machinelearning research sota variants

More from www.reddit.com / Machine Learning

[N] Snowflake releases open (Apache 2.0) 128x3B MoE model 9 hours ago | www.reddit.com

apache apache 2.0 machinelearning moe +2

[D] Why would such a simple sentence break an LLM? 9 hours ago | www.reddit.com

copilot disadvantages german gpt4 +7

[R] I made an app to predict ICML paper acceptance from reviews 13 hours ago | www.reddit.com

analysis conferences iclr machinelearning +6

[R] SpaceByte: Towards Deleting Tokenization from Large Language Modeling - Rice University 2024 - Practically … 14 hours ago | www.reddit.com

abstract machinelearning

[D] Keeping track of models and their associated metadata. 15 hours ago | www.reddit.com

industry machinelearning metadata project +1

[D] How researcher think of inductive bias when thinking of creating new/improving foundational models? 23 hours ago | www.reddit.com

bias foundational foundational models improving +14

[R] Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking 1 day, 2 hours ago | www.reddit.com

clip documents encode generalized +15

[D] Practical uses of AI inside companies 1 day, 3 hours ago | www.reddit.com

ai inside companies concrete course +17

Meta does everything OpenAI should be [D] 1 day, 3 hours ago | www.reddit.com

become capabilities commercial everything +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Enterprise AI Architect

@ Oracle | Broomfield, CO, United States

View on ai-jobs.net

Cloud Data Engineer France H/F (CDI - Confirmé)

@ Talan | Nantes, France

View on ai-jobs.net