all AI news
Topic: human feedback
Self-Instruct Framework, Explained
2 weeks, 3 days ago |
towardsdatascience.com
MetaRM: Shifted Distributions Alignment via Meta-Learning
2 weeks, 4 days ago |
arxiv.org
A Survey of Reinforcement Learning from Human Feedback
2 weeks, 5 days ago |
arxiv.org
Contrastive Preference Learning: Learning from Human Feedback without RL
2 weeks, 5 days ago |
arxiv.org
Optimal Design for Human Feedback
3 weeks, 6 days ago |
arxiv.org
High-Dimension Human Value Representation in Large Language Models
1 month, 1 week ago |
arxiv.org
Removing RLHF Protections in GPT-4 via Fine-Tuning
1 month, 1 week ago |
arxiv.org
Learning from Little Human Feedback [R] [P]
1 month, 2 weeks ago |
www.reddit.com
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
1 month, 2 weeks ago |
arxiv.org
Items published with this topic over the last 90 days.
Latest
Self-Instruct Framework, Explained
2 weeks, 3 days ago |
towardsdatascience.com
MetaRM: Shifted Distributions Alignment via Meta-Learning
2 weeks, 4 days ago |
arxiv.org
A Survey of Reinforcement Learning from Human Feedback
2 weeks, 5 days ago |
arxiv.org
Contrastive Preference Learning: Learning from Human Feedback without RL
2 weeks, 5 days ago |
arxiv.org
Optimal Design for Human Feedback
3 weeks, 6 days ago |
arxiv.org
High-Dimension Human Value Representation in Large Language Models
1 month, 1 week ago |
arxiv.org
Removing RLHF Protections in GPT-4 via Fine-Tuning
1 month, 1 week ago |
arxiv.org
Learning from Little Human Feedback [R] [P]
1 month, 2 weeks ago |
www.reddit.com
Confidence-aware Reward Optimization for Fine-tuning Text-to-Image Models
1 month, 2 weeks ago |
arxiv.org
Topic trend (last 90 days)
Top (last 7 days)
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US