May 19, 2022, 1:12 a.m. | Alessandro Ronca, Gabriel Paludo Licks, Giuseppe De Giacomo

cs.LG updates on arXiv.org arxiv.org

Our work aims at developing reinforcement learning algorithms that do not
rely on the Markov assumption. We consider the class of Non-Markov Decision
Processes where histories can be abstracted into a finite set of states while
preserving the dynamics. We call it a Markov abstraction since it induces a
Markov Decision Process over a set of states that encode the non-Markov
dynamics. This phenomenon underlies the recently introduced Regular Decision
Processes (as well as POMDPs where only a finite number …

arxiv decision learning markov processes reinforcement reinforcement learning

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Machine Learning Operations (MLOps) Engineer - Advisor

@ Peraton | Fort Lewis, WA, United States

Mid +/Senior Data Engineer (AWS/GCP)

@ Capco | Poland

Senior Software Engineer (ETL and Azure Databricks)|| RR/463/2024 || 4 - 7 Years

@ Emids | Bengaluru, India

Senior Data Scientist (H/F)

@ Business & Decision | Toulouse, France

Senior Analytics Engineer

@ Algolia | Paris, France