all AI news
Topic: moe
WEIRD AI News (An Honest Take!)
2 weeks, 2 days ago |
www.youtube.com
480B LLM as 128x4B MoE? WHY?
2 weeks, 2 days ago |
www.youtube.com
Snowflake trains 480B MoE in $2 million 🔥
2 weeks, 3 days ago |
unwindai.substack.com
Snowflake Touts Speed, Efficiency of New ‘Arctic’ LLM
2 weeks, 4 days ago |
www.datanami.com
[N] Snowflake releases open (Apache 2.0) 128x3B MoE model
2 weeks, 4 days ago |
www.reddit.com
[D] Are there any MoE models other than LLMs?
2 weeks, 5 days ago |
www.reddit.com
Era of Hyper-Real AI Videos is here 🤯
3 weeks, 3 days ago |
unwindai.substack.com
Jamba: A Hybrid Transformer-Mamba Language Model
1 month, 1 week ago |
arxiv.org
[D] What's your go-to simple MoE training code project?
1 month, 1 week ago |
www.reddit.com
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 1 week ago |
www.youtube.com
DBRX: MOST POWERFUL Open Source LLM - NEW @Databricks
1 month, 2 weeks ago |
www.youtube.com
[D] I don't understand how backprop works on sparsely gated MoE
1 month, 3 weeks ago |
www.reddit.com
Applying Mixture of Experts in LLM Architectures
1 month, 4 weeks ago |
developer.nvidia.com
Items published with this topic over the last 90 days.
Latest
WEIRD AI News (An Honest Take!)
2 weeks, 2 days ago |
www.youtube.com
480B LLM as 128x4B MoE? WHY?
2 weeks, 2 days ago |
www.youtube.com
Snowflake trains 480B MoE in $2 million 🔥
2 weeks, 3 days ago |
unwindai.substack.com
Snowflake Touts Speed, Efficiency of New ‘Arctic’ LLM
2 weeks, 4 days ago |
www.datanami.com
[N] Snowflake releases open (Apache 2.0) 128x3B MoE model
2 weeks, 4 days ago |
www.reddit.com
[D] Are there any MoE models other than LLMs?
2 weeks, 5 days ago |
www.reddit.com
Era of Hyper-Real AI Videos is here 🤯
3 weeks, 3 days ago |
unwindai.substack.com
Jamba: A Hybrid Transformer-Mamba Language Model
1 month, 1 week ago |
arxiv.org
[D] What's your go-to simple MoE training code project?
1 month, 1 week ago |
www.reddit.com
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 1 week ago |
www.youtube.com
DBRX: MOST POWERFUL Open Source LLM - NEW @Databricks
1 month, 2 weeks ago |
www.youtube.com
[D] I don't understand how backprop works on sparsely gated MoE
1 month, 3 weeks ago |
www.reddit.com
Applying Mixture of Experts in LLM Architectures
1 month, 4 weeks ago |
developer.nvidia.com
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York