Online Learning with Unknown Constraints | allainews.com

March 8, 2024, 5:41 a.m. | Karthik Sridharan, Seung Won Wilson Yoo

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.04033v1 Announce Type: new
Abstract: We consider the problem of online learning where the sequence of actions played by the learner must adhere to an unknown safety constraint at every round. The goal is to minimize regret with respect to the best safe action in hindsight while simultaneously satisfying the safety constraint with high probability on each round. We provide a general meta-algorithm that leverages an online regression oracle to estimate the unknown safety constraint, and converts the predictions of …

abstract arxiv constraints cs.ai cs.lg every math.st online learning probability safety stat.ml stat.th type

More from arxiv.org / cs.LG updates on arXiv.org

APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference 6 hours ago | arxiv.org

abstract arxiv cs.cl cs.lg +15

Brain-Inspired Spiking Neural Networks for Industrial Fault Diagnosis: A Survey, Challenges, and Opportunities 6 hours ago | arxiv.org

abstract arxiv brain brain-inspired +21

Data-driven Energy Efficiency Modelling in Large-scale Networks: An Expert Knowledge and ML-based Approach 6 hours ago | arxiv.org

abstract arxiv challenge complexity +23

Learned Regularization for Inverse Problems: Insights from a Spectral Model 6 hours ago | arxiv.org

abstract art arxiv convergence +14

LLMs cannot find reasoning errors, but can correct them given the error location 6 hours ago | arxiv.org

abstract arxiv become chen +17

Conditional Denoising Diffusion Probabilistic Models for Data Reconstruction Enhancement in Wireless Communications 6 hours ago | arxiv.org

abstract arxiv channels communications +17

Deep ReLU networks and high-order finite element methods II: Chebyshev emulation 6 hours ago | arxiv.org

abstract arxiv continuous cs.lg +17

Robust Energy Consumption Prediction with a Missing Value-Resilient Metaheuristic-based Neural Network in Mobile App Development 6 hours ago | arxiv.org

abstract app application arxiv +21

On Universally Optimal Algorithms for A/B Testing 6 hours ago | arxiv.org

abstract a/b testing algorithm algorithms +17

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Machine Learning Engineer

@ BlackStone eIT | Egypt - Remote

View on ai-jobs.net

Machine Learning Engineer - 2

@ Parspec | Bengaluru, India

View on ai-jobs.net