all AI news
MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST
Dec. 30, 2023, 10:30 a.m. | code_your_own_AI
code_your_own_AI www.youtube.com
All rights with Authors:
https://huggingface.co/xiuyul
https://huggingface.co/xiuyul/mamba-2.8b-zephyr
.. this is a fine-tuned version of xiuyul/mamba-2.8b-ultrachat on the HuggingFaceH4/ultrafeedback_binarized dataset trained using Direct Preference Optimization (DPO).
For further details (MAMBA code implementation) see my Community tab.
#ai
#aieducation
#airesearch
authors dataset direct preference optimization mamba maths optimization performance reasoning rights sft tasks test world zephyr
More from www.youtube.com / code_your_own_AI
Understand DSPy: Programming AI Pipelines
1 day, 16 hours ago |
www.youtube.com
Latest Insights in AI Performance Models
3 days, 16 hours ago |
www.youtube.com
New Discovery: Retrieval Heads for Long Context
5 days, 16 hours ago |
www.youtube.com
Multi-Token Prediction (forget next token LLM?)
6 days, 16 hours ago |
www.youtube.com
LLMs: Rewriting Our Tomorrow (plus code) #ai
1 week, 2 days ago |
www.youtube.com
Autonomous AI Agents: 14 % MAX Performance
1 week, 3 days ago |
www.youtube.com
480B LLM as 128x4B MoE? WHY?
1 week, 5 days ago |
www.youtube.com
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US