Safe Exploration Incurs Nearly No Additional Sample Complexity for Reward-free RL. (arXiv:2206.14057v1 [cs.LG]) | allainews.com

June 29, 2022, 1:11 a.m. | Ruiquan Huang, Jing Yang, Yingbin Liang

stat.ML updates on arXiv.org arxiv.org

While the primary goal of the exploration phase in reward-free reinforcement
learning (RF-RL) is to reduce the uncertainty in the estimated model with
minimum number of trajectories, in practice, the agent often needs to abide by
certain safety constraint at the same time. It remains unclear how such safe
exploration requirement would affect the corresponding sample complexity to
achieve the desired optimality of the obtained policy in planning. In this
work, we make a first attempt to answer this question. …

arxiv complexity exploration free lg rl

More from arxiv.org / stat.ML updates on arXiv.org

Nuisance Function Tuning for Optimal Doubly Robust Estimation 2 days, 19 hours ago | arxiv.org

abstract arxiv convergence function +12

Fast Topological Signal Identification and Persistent Cohomological Cycle Matching 2 days, 19 hours ago | arxiv.org

abstract analysis applications art +20

Neural Networks for Extreme Quantile Regression with an Application to Forecasting of Flood Risk 2 days, 19 hours ago | arxiv.org

abstract application arxiv assessment +17

The High Line: Exact Risk and Learning Rate Curves of Stochastic Adaptive Learning Rate Algorithms 2 days, 19 hours ago | arxiv.org

abstract algorithms arxiv call +15

Comparison of Point Process Learning and its special case Takacs-Fiksel estimation 2 days, 19 hours ago | arxiv.org

abstract arxiv case comparison +14

Algorithmically Designed Artificial Neural Networks (ADANNs): Higher order deep operator learning for parametric partial differential … 3 days, 19 hours ago | arxiv.org

abstract ann architectures article +18

Adaptive posterior concentration rates for sparse high-dimensional linear regression with random design and unknown error … 3 days, 19 hours ago | arxiv.org

abstract analyze arxiv design +13

CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration 3 days, 19 hours ago | arxiv.org

abstract aggregation arxiv bio +14

Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors 3 days, 19 hours ago | arxiv.org

abstract arxiv bayesian capability +15

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A

View on ai-jobs.net