all AI news
Batch Active Learning of Reward Functions from Human Preferences
Feb. 27, 2024, 5:41 a.m. | Erdem B{\i}y{\i}k, Nima Anari, Dorsa Sadigh
cs.LG updates on arXiv.org arxiv.org
Abstract: Data generation and labeling are often expensive in robot learning. Preference-based learning is a concept that enables reliable labeling by querying users with preference questions. Active querying methods are commonly employed in preference-based learning to generate more informative data at the expense of parallelization and computation time. In this paper, we develop a set of novel algorithms, batch active preference-based learning methods, that enable efficient learning of reward functions using as few data samples as …
abstract active learning arxiv computation concept cs.ai cs.lg cs.ro data functions generate human labeling parallelization questions robot stat.ml type
More from arxiv.org / cs.LG updates on arXiv.org
Testing the Segment Anything Model on radiology data
2 days, 7 hours ago |
arxiv.org
Calorimeter shower superresolution
2 days, 7 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US