all AI news
Topic: supervised fine-tuning
D2PO: Discriminator-Guided DPO with Response Evaluation Models
2 weeks, 3 days ago |
arxiv.org
Homonym Sense Disambiguation in the Georgian Language
2 weeks, 3 days ago |
arxiv.org
Supervised Fine-tuning in turn Improves Visual Foundation Models
1 month, 1 week ago |
arxiv.org
Positive Unlabeled Contrastive Learning
1 month, 2 weeks ago |
arxiv.org
Reference-free Monolithic Preference Optimization with Odds Ratio
2 months, 1 week ago |
arxiv.org
What is Supervised Fine-Tuning?
2 months, 2 weeks ago |
www.youtube.com
From Supervised Fine-Tuning to Online Feedback
2 months, 3 weeks ago |
gradientflow.com
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 4 weeks ago |
arxiv.org
Constrained Decoding for Cross-lingual Label Projection
3 months, 2 weeks ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
D2PO: Discriminator-Guided DPO with Response Evaluation Models
2 weeks, 3 days ago |
arxiv.org
Homonym Sense Disambiguation in the Georgian Language
2 weeks, 3 days ago |
arxiv.org
Supervised Fine-tuning in turn Improves Visual Foundation Models
1 month, 1 week ago |
arxiv.org
Positive Unlabeled Contrastive Learning
1 month, 2 weeks ago |
arxiv.org
Reference-free Monolithic Preference Optimization with Odds Ratio
2 months, 1 week ago |
arxiv.org
What is Supervised Fine-Tuning?
2 months, 2 weeks ago |
www.youtube.com
From Supervised Fine-Tuning to Online Feedback
2 months, 3 weeks ago |
gradientflow.com
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
2 months, 4 weeks ago |
arxiv.org
Constrained Decoding for Cross-lingual Label Projection
3 months, 2 weeks ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US