all AI news
Topic: softmax
Learn how to implement the softmax function in python!
1 month, 3 weeks ago |
dev.to
Transformers as Support Vector Machines
2 months, 2 weeks ago |
arxiv.org
Gated Linear Attention Transformers with Hardware-Efficient Training
2 months, 2 weeks ago |
arxiv.org
Reusing Softmax Hardware Unit for GELU Computation in Transformers
2 months, 3 weeks ago |
arxiv.org
Knowledge Distillation Under Ideal Joint Classifier Assumption
2 months, 3 weeks ago |
arxiv.org
A Common Misconception About Cross Entropy Loss
2 months, 3 weeks ago |
www.reddit.com
Superiority of Multi-Head Attention in In-Context Linear Regression
3 months, 1 week ago |
arxiv.org
[D] Initializing a Small LLM to Reflect Natural Token Distribution
3 months, 1 week ago |
www.reddit.com
LLM Transformers 101 (Part 5 of 5): Linear Transformation & Softmax
3 months, 3 weeks ago |
www.youtube.com
Is Everyone in data science a mathematician
4 months, 2 weeks ago |
www.reddit.com
[D] High-temperature softmax
6 months, 2 weeks ago |
www.reddit.com
CoViz - A Visual Deep Learning Framework built with WebGPU 🔥
10 months, 4 weeks ago |
www.reddit.com
Good references for tempered softmax?
11 months, 4 weeks ago |
www.reddit.com
Items published with this topic over the last 90 days.
Latest
Learn how to implement the softmax function in python!
1 month, 3 weeks ago |
dev.to
Transformers as Support Vector Machines
2 months, 2 weeks ago |
arxiv.org
Gated Linear Attention Transformers with Hardware-Efficient Training
2 months, 2 weeks ago |
arxiv.org
Reusing Softmax Hardware Unit for GELU Computation in Transformers
2 months, 3 weeks ago |
arxiv.org
Knowledge Distillation Under Ideal Joint Classifier Assumption
2 months, 3 weeks ago |
arxiv.org
A Common Misconception About Cross Entropy Loss
2 months, 3 weeks ago |
www.reddit.com
Superiority of Multi-Head Attention in In-Context Linear Regression
3 months, 1 week ago |
arxiv.org
[D] Initializing a Small LLM to Reflect Natural Token Distribution
3 months, 1 week ago |
www.reddit.com
LLM Transformers 101 (Part 5 of 5): Linear Transformation & Softmax
3 months, 3 weeks ago |
www.youtube.com
Is Everyone in data science a mathematician
4 months, 2 weeks ago |
www.reddit.com
[D] High-temperature softmax
6 months, 2 weeks ago |
www.reddit.com
CoViz - A Visual Deep Learning Framework built with WebGPU 🔥
10 months, 4 weeks ago |
www.reddit.com
Good references for tempered softmax?
11 months, 4 weeks ago |
www.reddit.com
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US