all AI news
Topic: sft
[D] Is EOS token crucial during pre-training?
2 days, 16 hours ago |
www.reddit.com
Supervised Fine-tuning in turn Improves Visual Foundation Models
3 weeks, 6 days ago |
arxiv.org
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 1 week ago |
www.youtube.com
ORPO: NEW DPO Alignment and SFT Method for LLM
1 month, 2 weeks ago |
www.youtube.com
Reference-free Monolithic Preference Optimization with Odds Ratio
1 month, 3 weeks ago |
arxiv.org
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 2 weeks ago |
arxiv.org
Meet తెలుగు Llama
2 months, 3 weeks ago |
analyticsindiamag.com
Meet తెలుగు Llama
2 months, 3 weeks ago |
analyticsindiamag.com
NEW Code for SFT and DPO Training: Unsloth LLama
3 months, 2 weeks ago |
www.youtube.com
MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST
4 months, 1 week ago |
www.youtube.com
Zephyr 7B beta - How much does DPO really help?
6 months, 1 week ago |
www.youtube.com
Train MISTRAL 7B to outperform LLama 2 70B (ZEPHYR)
6 months, 3 weeks ago |
www.youtube.com
This is 🔥 AI News explained for NERDS!!!
6 months, 3 weeks ago |
www.youtube.com
Items published with this topic over the last 90 days.
Latest
[D] Is EOS token crucial during pre-training?
2 days, 16 hours ago |
www.reddit.com
Supervised Fine-tuning in turn Improves Visual Foundation Models
3 weeks, 6 days ago |
arxiv.org
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 1 week ago |
www.youtube.com
ORPO: NEW DPO Alignment and SFT Method for LLM
1 month, 2 weeks ago |
www.youtube.com
Reference-free Monolithic Preference Optimization with Odds Ratio
1 month, 3 weeks ago |
arxiv.org
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 2 weeks ago |
arxiv.org
Meet తెలుగు Llama
2 months, 3 weeks ago |
analyticsindiamag.com
Meet తెలుగు Llama
2 months, 3 weeks ago |
analyticsindiamag.com
NEW Code for SFT and DPO Training: Unsloth LLama
3 months, 2 weeks ago |
www.youtube.com
MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST
4 months, 1 week ago |
www.youtube.com
Zephyr 7B beta - How much does DPO really help?
6 months, 1 week ago |
www.youtube.com
Train MISTRAL 7B to outperform LLama 2 70B (ZEPHYR)
6 months, 3 weeks ago |
www.youtube.com
This is 🔥 AI News explained for NERDS!!!
6 months, 3 weeks ago |
www.youtube.com
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US