April 25, 2024, 10 p.m. | /u/Prudent_Student2839

Machine Learning www.reddit.com

My friend implemented the method of Multihead Mixture of Experts in this arxiv paper [https://arxiv.org/pdf/2404.15045](https://arxiv.org/pdf/2404.15045) and he wanted me to share it with you!

https://github.com/lhallee/Multi_Head_Mixture_of_Experts__MH-MOE

Try it out. Let me know what you think and I will pass it on to him.

machinelearning think will

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne