all AI news
Tri Dao, Stanford: On FlashAttention and sparsity, quantization, and efficient inference
Feb. 12, 2024, 8:21 p.m. | /u/thejashGI
Deep Learning www.reddit.com
Some topics covered:
* Taking a contrarian bet on recurrent connections over attention
* Using data augmentation to encode knowledge into models
* Designing algorithms that take advantage of hardware
Listen to the conversation:
* [Spotify](https://open.spotify.com/show/1hikWa5LWDQJwXtz5LoeVn)
* [Apple Podcasts](https://podcasts.apple.com/us/podcast/generally-intelligent/id1544921720)
* [Pocket Casts](https://pca.st/ewh266dr)
* [Highlights and referenced papers](https://imbue.com/podcast/2024-02-08-podcast-episode-33-tri-dao/)
algorithms attention augmentation conversation data deeplearning designing encode hardware knowledge the conversation topics
More from www.reddit.com / Deep Learning
Classification of images with numerical "continous" categories
1 day, 15 hours ago |
www.reddit.com
How does gradient descent work in random forest
2 days, 7 hours ago |
www.reddit.com
Prerequisites for jumping into transformers?
2 days, 9 hours ago |
www.reddit.com
[Reading] Deeplearning by goodfellow
2 days, 15 hours ago |
www.reddit.com
Linearizing Large Language Models
3 days, 7 hours ago |
www.reddit.com
Converting Soft tokens to Hard tokens in Llama2
3 days, 9 hours ago |
www.reddit.com
Detection of free parking spaces
3 days, 16 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US