all AI news
Topic: sft
[D] Is EOS token crucial during pre-training?
1 week, 6 days ago |
www.reddit.com
Supervised Fine-tuning in turn Improves Visual Foundation Models
1 month, 1 week ago |
arxiv.org
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 3 weeks ago |
www.youtube.com
ORPO: NEW DPO Alignment and SFT Method for LLM
1 month, 3 weeks ago |
www.youtube.com
Reference-free Monolithic Preference Optimization with Odds Ratio
2 months, 1 week ago |
arxiv.org
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 4 weeks ago |
arxiv.org
NEW Code for SFT and DPO Training: Unsloth LLama
3 months, 3 weeks ago |
www.youtube.com
MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST
4 months, 2 weeks ago |
www.youtube.com
Zephyr 7B beta - How much does DPO really help?
6 months, 2 weeks ago |
www.youtube.com
This is 🔥 AI News explained for NERDS!!!
7 months ago |
www.youtube.com
Nothing found.
Items published with this topic over the last 90 days.
Latest
[D] Is EOS token crucial during pre-training?
1 week, 6 days ago |
www.reddit.com
Supervised Fine-tuning in turn Improves Visual Foundation Models
1 month, 1 week ago |
arxiv.org
JAMBA MoE: Open Source MAMBA w/ Transformer: CODE
1 month, 3 weeks ago |
www.youtube.com
ORPO: NEW DPO Alignment and SFT Method for LLM
1 month, 3 weeks ago |
www.youtube.com
Reference-free Monolithic Preference Optimization with Odds Ratio
2 months, 1 week ago |
arxiv.org
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 4 weeks ago |
arxiv.org
NEW Code for SFT and DPO Training: Unsloth LLama
3 months, 3 weeks ago |
www.youtube.com
MAMBA 2.8B ZEPHYR Fine-Tuned + DPO-Aligned: TEST
4 months, 2 weeks ago |
www.youtube.com
Zephyr 7B beta - How much does DPO really help?
6 months, 2 weeks ago |
www.youtube.com
This is 🔥 AI News explained for NERDS!!!
7 months ago |
www.youtube.com
Topic trend (last 90 days)
Top (last 7 days)
Nothing found.
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US