all AI news
Exploiting Expert-guided Symmetry Detection in Offline Reinforcement Learning. (arXiv:2112.09943v2 [cs.LG] UPDATED)
cs.LG updates on arXiv.org arxiv.org
Offline estimation of the dynamical model of a Markov Decision Process (MDP)
is a non-trivial task that greatly depends on the data available to the
learning phase. Sometimes the dynamics of the model is invariant with respect
to some transformations of the current state and action. Recent works showed
that an expert-guided pipeline relying on Density Estimation methods as Deep
Neural Network based Normalizing Flows effectively detects this structure in
deterministic environments, both categorical and continuous-valued. The
acquired knowledge can …
arxiv detection expert learning reinforcement reinforcement learning symmetry