No-Regret Reinforcement Learning in Smooth MDPs | allainews.com

Feb. 7, 2024, 5:42 a.m. | Davide Maran Alberto Maria Metelli Matteo Papini Marcello Restell

cs.LG updates on arXiv.org arxiv.org

Obtaining no-regret guarantees for reinforcement learning (RL) in the case of problems with continuous state and/or action spaces is still one of the major open challenges in the field. Recently, a variety of solutions have been proposed, but besides very specific settings, the general problem remains unsolved. In this paper, we introduce a novel structural assumption on the Markov decision processes (MDPs), namely $\nu-$smoothness, that generalizes most of the settings proposed so far (e.g., linear MDPs and Lipschitz MDPs). To …

case challenges continuous cs.ai cs.lg general major novel paper reinforcement reinforcement learning solutions spaces state unsolved

More from arxiv.org / cs.LG updates on arXiv.org

Challenging the Human-in-the-loop in Algorithmic Decision-making now | arxiv.org

Off-the-Shelf Neural Network Architectures for Forex Time Series Prediction come at a Cost a second ago | arxiv.org

abstract analyze ann architecture +21

Cost-Effective Fault Tolerance for CNNs Using Parameter Vulnerability Based Hardening and Pruning a second ago | arxiv.org

abstract applications arxiv become +17

Cyclical Weight Consolidation: Towards Solving Catastrophic Forgetting in Serial Federated Learning 2 seconds ago | arxiv.org

abstract algorithms arxiv attention +19

Hi-GMAE: Hierarchical Graph Masked Autoencoders 3 seconds ago | arxiv.org

abstract arxiv autoencoders cs.lg +17

Harnessing Collective Structure Knowledge in Data Augmentation for Graph Neural Networks 4 seconds ago | arxiv.org

abstract art arxiv augmentation +23

Sample-Efficient Constrained Reinforcement Learning with General Parameterization 4 seconds ago | arxiv.org

abstract agent arxiv building +14

Historically Relevant Event Structuring for Temporal Knowledge Graph Reasoning 5 seconds ago | arxiv.org

abstract arxiv correlations cs.ai +19

Distributed Event-Based Learning via ADMM 6 seconds ago | arxiv.org

abstract agents arxiv communication +15

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net