Web: http://arxiv.org/abs/2201.12126

Jan. 31, 2022, 2:11 a.m. | Niklas Höpner, Ilaria Tiddi, Herke van Hoof

cs.LG updates on arXiv.org arxiv.org

Enabling reinforcement learning (RL) agents to leverage a knowledge base
while learning from experience promises to advance RL in knowledge intensive
domains. However, it has proven difficult to leverage knowledge that is not
manually tailored to the environment. We propose to use the subclass
relationships present in open-source knowledge graphs to abstract away from
specific objects. We develop a residual policy gradient method that is able to
integrate knowledge across different abstraction levels in the class hierarchy.
Our method results …

ai arxiv gradient learning policy reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

Senior Data Analyst

@ Fanatics Inc | Remote - New York

Data Engineer - Search

@ Cytora | United Kingdom - Remote

Product Manager, Technical - Data Infrastructure and Streaming

@ Nubank | Berlin

Postdoctoral Fellow: ML for autonomous materials discovery

@ Lawrence Berkeley National Lab | Berkeley, CA

Principal Data Scientist

@ Zuora | Remote

Data Engineer

@ Veeva Systems | Pennsylvania - Fort Washington