all AI news
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning
April 22, 2024, 4:42 a.m. | Colin Bellinger, Mark Crowley, Isaac Tamblyn
cs.LG updates on arXiv.org arxiv.org
Abstract: Reinforcement learning (RL) has been shown to learn sophisticated control policies for complex tasks including games, robotics, heating and cooling systems and text generation. The action-perception cycle in RL, however, generally assumes that a measurement of the state of the environment is available at each time step without a cost. In applications such as materials design, deep-sea and planetary robot exploration and medicine, however, there can be a high cost associated with measuring, or even …
abstract arxiv control cooling cost cs.ai cs.lg dynamic environment games however learn measurement observation perception policies reinforcement reinforcement learning robotics state systems tasks text text generation the environment type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Data Engineer (m/f/d)
@ Project A Ventures | Berlin, Germany
Principle Research Scientist
@ Analog Devices | US, MA, Boston