April 22, 2024, 4:42 a.m. | Colin Bellinger, Mark Crowley, Isaac Tamblyn

cs.LG updates on arXiv.org arxiv.org

arXiv:2307.02620v3 Announce Type: replace
Abstract: Reinforcement learning (RL) has been shown to learn sophisticated control policies for complex tasks including games, robotics, heating and cooling systems and text generation. The action-perception cycle in RL, however, generally assumes that a measurement of the state of the environment is available at each time step without a cost. In applications such as materials design, deep-sea and planetary robot exploration and medicine, however, there can be a high cost associated with measuring, or even …

abstract arxiv control cooling cost cs.ai cs.lg dynamic environment games however learn measurement observation perception policies reinforcement reinforcement learning robotics state systems tasks text text generation the environment type

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US