all AI news
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement Learning
April 22, 2024, 4:42 a.m. | Colin Bellinger, Mark Crowley, Isaac Tamblyn
cs.LG updates on arXiv.org arxiv.org
Abstract: Reinforcement learning (RL) has been shown to learn sophisticated control policies for complex tasks including games, robotics, heating and cooling systems and text generation. The action-perception cycle in RL, however, generally assumes that a measurement of the state of the environment is available at each time step without a cost. In applications such as materials design, deep-sea and planetary robot exploration and medicine, however, there can be a high cost associated with measuring, or even …
abstract arxiv control cooling cost cs.ai cs.lg dynamic environment games however learn measurement observation perception policies reinforcement reinforcement learning robotics state systems tasks text text generation the environment type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US