MoE - I'm a bit confused about 'Experts' [D] | allainews.com

Feb. 19, 2024, 4:41 p.m. | /u/Kaldnite

Machine Learning www.reddit.com

I've been doing some reading about Mixture of Experts (MoE) models, and how they penalise the models to ensure that the distribution of activations are equal across all X experts.

Now that being said, is it reasonable to say that it isn't a "This is the Maths expert, and that's the Science expert", but rather a black box of optimised sub-models trained on lots training data to target different dimensions of the input query?

I'm viewing it more as a …

distribution expert experts isn machinelearning maths mixture of experts moe reading

More from www.reddit.com / Machine Learning

[Discussion] Seeking help to find the better GPU setup. Three H100 vs Five A100? 4 hours ago | www.reddit.com

70b a100 budget five +9

[D] Something I always think about, for top conferences like ICML, NeurIPS, CVPR,..etc. How many … 5 hours ago | www.reddit.com

conferences cvpr etc good +8

[D] Benchmark creators should release their benchmark datasets in stages 6 hours ago | www.reddit.com

benchmark benchmarks concerns data +11

[P] spRAG - Open-source RAG implementation for challenging real-world tasks 7 hours ago | www.reddit.com

core hey implementation machinelearning +7

[D] Paper accepted to ICML but not attending in person? 9 hours ago | www.reddit.com

authors conference icml machinelearning +6

[D] Why do juniors (undergraduates or first- to second-year PhD students) have so many papers … 11 hours ago | www.reddit.com

academic conferences etc hello +12

[D] How can I detect the text orientation using MMOCR or MMDET models? 15 hours ago | www.reddit.com

example image images issue +5

[D] Current state of Chatbot pipelines in Commercial settings? 20 hours ago | www.reddit.com

build chatbot commercial current +12

[R] Training-free Graph Neural Networks and the Power of Labels as Features 23 hours ago | www.reddit.com

features free graph graph neural networks +6

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA

View on ai-jobs.net