Learning Explainable and Better Performing Representations of POMDP Strategies | allainews.com

May 22, 2024, 4:43 a.m. | Alexander Bork, Debraj Chakraborty, Kush Grover, Jan Kretinsky, Stefanie Mohr

cs.LG updates on arXiv.org arxiv.org

arXiv:2401.07656v3 Announce Type: replace-cross
Abstract: Strategies for partially observable Markov decision processes (POMDP) typically require memory. One way to represent this memory is via automata. We present a method to learn an automaton representation of a strategy using a modification of the L*-algorithm. Compared to the tabular representation of a strategy, the resulting automaton is dramatically smaller and thus also more explainable. Moreover, in the learning process, our heuristics may even improve the strategy's performance. In contrast to approaches that …

abstract algorithm arxiv automaton cs.ai cs.lg cs.lo decision learn markov memory observable processes replace representation strategies strategy tabular type via

More from arxiv.org / cs.LG updates on arXiv.org

Consistent3D: Towards Consistent High-Fidelity Text-to-3D Generation with Deterministic Sampling Prior 1 day, 18 hours ago | arxiv.org

arxiv consistent cs.cv cs.lg +6

Machine-learned models for magnetic materials 1 day, 18 hours ago | arxiv.org

abstract arxiv autoencoder cond-mat.mtrl-sci +17

Revisiting RIP guarantees for sketching operators on mixture models 1 day, 18 hours ago | arxiv.org

abstract alternative analysis arxiv +9

Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata 1 day, 18 hours ago | arxiv.org

abstract accuracy arxiv assessment +16

Getting More for Less: Using Weak Labels and AV-Mixup for Robust Audio-Visual Speaker Verification 1 day, 18 hours ago | arxiv.org

abstract arxiv audio cs.cv +18

Neural-network quantum state study of the long-range antiferromagnetic Ising chain 1 day, 18 hours ago | arxiv.org

abstract arxiv boltzmann cond-mat.quant-gas +12

Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on … 1 day, 18 hours ago | arxiv.org

abstract arxiv assumptions cs.lg +22

Vortex Feature Positioning: Bridging Tabular IIoT Data and Image-Based Deep Learning 1 day, 18 hours ago | arxiv.org

abstract arxiv cs.cv cs.lg +19

Provably Efficient Exploration in Quantum Reinforcement Learning with Logarithmic Worst-Case Regret 1 day, 18 hours ago | arxiv.org

abstract algorithms arxiv attention +20

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Analyst, Data Analytics

@ T. Rowe Price | Owings Mills, MD - Building 4

View on ai-jobs.net

Regulatory Data Analyst

@ Federal Reserve System | San Francisco, CA

View on ai-jobs.net

Sr. Data Analyst

@ Bank of America | Charlotte

View on ai-jobs.net

Data Analyst- Tech Refresh

@ CACI International Inc | 1J5 WASHINGTON DC (BOLLING AFB)

View on ai-jobs.net

Senior AML/CFT & Data Analyst

@ Ocorian | Ebène, Mauritius

View on ai-jobs.net