MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST | allainews.com

Dec. 30, 2023, 10:30 a.m. | code_your_own_AI

code_your_own_AI www.youtube.com

Live test of MAMBA 2.8B fine-tuned and DPO-aligned. Real world performance of MAMBA 2.8B ZEPHYR (SFT + DPO) tested live on several performance tasks, including maths and logical reasoning.

All rights with Authors:
https://huggingface.co/xiuyul
https://huggingface.co/xiuyul/mamba-2.8b-zephyr
.. this is a fine-tuned version of xiuyul/mamba-2.8b-ultrachat on the HuggingFaceH4/ultrafeedback_binarized dataset trained using Direct Preference Optimization (DPO).

For further details (MAMBA code implementation) see my Community tab.

#ai
#aieducation
#airesearch

authors dataset direct preference optimization mamba maths optimization performance reasoning rights sft tasks test world zephyr

More from www.youtube.com / code_your_own_AI

Understand DSPy: Programming AI Pipelines 1 day, 16 hours ago | www.youtube.com

case dspy engineering evolution +9

Latest Insights in AI Performance Models 3 days, 16 hours ago | www.youtube.com

ai performance ai research benchmarks beyond +20

New Discovery: Retrieval Heads for Long Context 5 days, 16 hours ago | www.youtube.com

applications attention context dev +15

Multi-Token Prediction (forget next token LLM?) 6 days, 16 hours ago | www.youtube.com

architecture autoregressive benchmark data +13

NEW LLM Test: Reasoning & gpt2-chatbot 1 week ago | www.youtube.com

blind causal chatbot gpt2-chatbot +8

LLMs: Rewriting Our Tomorrow (plus code) #ai 1 week, 2 days ago | www.youtube.com

ai systems code effects future +10

Autonomous AI Agents: 14 % MAX Performance 1 week, 3 days ago | www.youtube.com

agents ai agents autonomous autonomous agents +14

480B LLM as 128x4B MoE? WHY? 1 week, 5 days ago | www.youtube.com

architecture architectures causal comparison +15

No more Fine-Tuning: Unsupervised ICL+ 2 weeks ago | www.youtube.com

advanced autonomous context deepmind +17

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net