all AI news
Flash-Decoding for long-context inference
Oct. 12, 2023, 5:59 p.m. | Tri Dao, Daniel Haziza, Francisco Massa, Grigory Sizov
Blog Content - TOGETHER www.together.xyz
attention during inference, bringing up to 8x faster generation for very
long sequences.
More from www.together.xyz / Blog Content - TOGETHER
Flash-Decoding for long-context inference
7 months, 1 week ago |
www.together.xyz
Faster inference enables up to 5x price reduction on Together API
9 months, 1 week ago |
www.together.xyz
Monarch Mixer: A new model architecture for increased efficiency
9 months, 3 weeks ago |
www.together.xyz
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US