Dec. 30, 2023, 10:30 a.m. | code_your_own_AI

code_your_own_AI www.youtube.com

Live test of MAMBA 2.8B fine-tuned and DPO-aligned. Real world performance of MAMBA 2.8B ZEPHYR (SFT + DPO) tested live on several performance tasks, including maths and logical reasoning.

All rights with Authors:
https://huggingface.co/xiuyul
https://huggingface.co/xiuyul/mamba-2.8b-zephyr
.. this is a fine-tuned version of xiuyul/mamba-2.8b-ultrachat on the HuggingFaceH4/ultrafeedback_binarized dataset trained using Direct Preference Optimization (DPO).

For further details (MAMBA code implementation) see my Community tab.

#ai
#aieducation
#airesearch

authors dataset direct preference optimization mamba maths optimization performance reasoning rights sft tasks test world zephyr

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US