Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It | allainews.com

April 24, 2024, 4:42 a.m. | Yuta Saito, Masahiro Nomura

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.15084v1 Announce Type: new
Abstract: There has been a growing interest in off-policy evaluation in the literature such as recommender systems and personalized medicine. We have so far seen significant progress in developing estimators aimed at accurately estimating the effectiveness of counterfactual policies based on biased logged data. However, there are many cases where those estimators are used not only to evaluate the value of decision making policies but also to search for the best hyperparameters from a large candidate …

abstract arxiv counterfactual cs.lg deal evaluation hyperparameter literature medicine optimization personalized policies policy progress recommender systems systems type

More from arxiv.org / cs.LG updates on arXiv.org

DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning 23 hours ago | arxiv.org

abstract agents arxiv benchmark +20

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation 23 hours ago | arxiv.org

abstract ai models arxiv beyond +27

Enabling Accelerators for Graph Computing 23 hours ago | arxiv.org

abstract accelerators applications arxiv +24

DUCK: Distance-based Unlearning via Centroid Kinematics 23 hours ago | arxiv.org

abstract acquired artificial artificial intelligence +16

Motion Informed Needle Segmentation in Ultrasound Images 23 hours ago | arxiv.org

abstract arxiv availability cs.cv +10

A ripple in time: a discontinuity in American history 23 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +13

An algorithm for forensic toolmark comparisons 23 hours ago | arxiv.org

abstract algorithm analysis arxiv +12

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models 23 hours ago | arxiv.org

arxiv characters consistent cs.cv +9

On Linear Separation Capacity of Self-Supervised Representation Learning 23 hours ago | arxiv.org

abstract adept advances arxiv +17

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net