Jan. 24, 2022, 2:10 a.m. | Tongzhou Mu, Kaixiang Lin, Feiyang Niu, Govind Thattai

cs.LG updates on arXiv.org arxiv.org

We present a two-step hybrid reinforcement learning (RL) policy that is
designed to generate interpretable and robust hierarchical policies on the RL
problem with graph-based input. Unlike prior deep reinforcement learning
policies parameterized by an end-to-end black-box graph neural network, our
approach disentangles the decision-making process into two steps. The first
step is a simplified classification problem that maps the graph input to an
action group where all actions share a similar semantic meaning. The second
step implements a sophisticated …

arxiv graph graph-based hybrid learning policy reinforcement learning

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst

@ SEAKR Engineering | Englewood, CO, United States

Data Analyst II

@ Postman | Bengaluru, India

Data Architect

@ FORSEVEN | Warwick, GB

Director, Data Science

@ Visa | Washington, DC, United States

Senior Manager, Data Science - Emerging ML

@ Capital One | McLean, VA