Learning Explainable and Better Performing Representations of POMDP Strategies | allainews.com

May 22, 2024, 4:43 a.m. | Alexander Bork, Debraj Chakraborty, Kush Grover, Jan Kretinsky, Stefanie Mohr

cs.LG updates on arXiv.org arxiv.org

arXiv:2401.07656v3 Announce Type: replace-cross
Abstract: Strategies for partially observable Markov decision processes (POMDP) typically require memory. One way to represent this memory is via automata. We present a method to learn an automaton representation of a strategy using a modification of the L*-algorithm. Compared to the tabular representation of a strategy, the resulting automaton is dramatically smaller and thus also more explainable. Moreover, in the learning process, our heuristics may even improve the strategy's performance. In contrast to approaches that …

abstract algorithm arxiv automaton cs.ai cs.lg cs.lo decision learn markov memory observable processes replace representation strategies strategy tabular type via

More from arxiv.org / cs.LG updates on arXiv.org

Lessons on Datasets and Paradigms in Machine Learning for Symbolic Computation: A Case Study on … 1 day, 5 hours ago | arxiv.org

abstract algebra algorithms arxiv +20

Learning to Maximize Gains From Trade in Small Markets 1 day, 5 hours ago | arxiv.org

abstract arxiv balance budget +18

Predicting and Interpreting Energy Barriers of Metallic Glasses with Graph Neural Networks 1 day, 5 hours ago | arxiv.org

abstract arxiv challenge cond-mat.dis-nn +20

Towards Enhancing the Reproducibility of Deep Learning Bugs: An Empirical Study 1 day, 5 hours ago | arxiv.org

abstract arxiv autonomous autonomous vehicles +19

GLIMPSE: Generalized Local Imaging with MLPs 1 day, 5 hours ago | arxiv.org

abstract art arxiv cnn +22

WWW: What, When, Where to Compute-in-Memory 1 day, 5 hours ago | arxiv.org

abstract architecture arxiv compute +20

Signatures Meet Dynamic Programming: Generalizing Bellman Equations for Trajectory Following 1 day, 5 hours ago | arxiv.org

abstract arxiv cs.lg cs.ro +16

Low latency optical-based mode tracking with machine learning deployed on FPGAs on a tokamak 1 day, 5 hours ago | arxiv.org

abstract applications arxiv cameras +26

Measuring and Mitigating Biases in Motor Insurance Pricing 1 day, 5 hours ago | arxiv.org

abstract arxiv biases construct +17

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Senior Robotics Engineer - Applications

@ Vention | Montréal, QC, Canada

View on ai-jobs.net

Senior Application Security Engineer, SHINE - Security Hub for Innovation and Efficiency

@ Amazon.com | Toronto, Ontario, CAN

View on ai-jobs.net

Simulation Scientist , WWDE Simulation

@ Amazon.com | Bellevue, Washington, USA

View on ai-jobs.net

Giáo Viên Steam

@ Việc Làm Giáo Dục | Da Nang, Da Nang, Vietnam

View on ai-jobs.net

Senior Simulation Developer

@ Vention | Montréal, QC, Canada

View on ai-jobs.net