Web: http://arxiv.org/abs/2106.00808

Sept. 23, 2022, 1:13 a.m. | Sorawit Saengkyongam, Nikolaj Thams, Jonas Peters, Niklas Pfister

stat.ML updates on arXiv.org arxiv.org

Contextual bandit and reinforcement learning algorithms have been
successfully used in various interactive learning systems such as online
advertising, recommender systems, and dynamic pricing. However, they have yet
to be widely adopted in high-stakes application domains, such as healthcare.
One reason may be that existing approaches assume that the underlying
mechanisms are static in the sense that they do not change over different
environments. In many real-world systems, however, the mechanisms are subject
to shifts across environments which may invalidate …

arxiv perspective policy

More from arxiv.org / stat.ML updates on arXiv.org

Research Scientists

@ ODU Research Foundation | Norfolk, Virginia

Embedded Systems Engineer (Robotics)

@ Neo Cybernetica | Bedford, New Hampshire

2023 Luis J. Alvarez and Admiral Grace M. Hopper Postdoc Fellowship in Computing Sciences

@ Lawrence Berkeley National Lab | San Francisco, CA

Senior Manager Data Scientist

@ NAV | Remote, US

Senior AI Research Scientist

@ Earth Species Project | Remote anywhere

Research Fellow- Center for Security and Emerging Technology (Multiple Opportunities)

@ University of California Davis | Washington, DC

Staff Fellow - Data Scientist

@ U.S. FDA/Center for Devices and Radiological Health | Silver Spring, Maryland

Staff Fellow - Senior Data Engineer

@ U.S. FDA/Center for Devices and Radiological Health | Silver Spring, Maryland

Machine Learning Data Engineer Intern (Jyoti Dharna)

@ Benson Hill | St. Louis, Missouri

Software Engineer / SDE I, Chime SDK Video Research Engineering

@ Amazon.com | East Palo Alto, California, USA

IND (New) Senior ML Ops Engineer - WiQ

@ Quantium | Hyderabad

Data Engineer

@ LendingTree | Remote