Learning for Bandits under Action Erasures | allainews.com

June 27, 2024, 4:45 a.m. | Osama Hanna, Merve Karakas, Lin F. Yang, Christina Fragouli

cs.LG updates on arXiv.org arxiv.org

arXiv:2406.18072v1 Announce Type: cross
Abstract: We consider a novel multi-arm bandit (MAB) setup, where a learner needs to communicate the actions to distributed agents over erasure channels, while the rewards for the actions are directly available to the learner through external sensors. In our model, while the distributed agents know if an action is erased, the central learner does not (there is no feedback), and thus does not know whether the observed reward resulted from the desired action or not. …

abstract action agents arm arxiv channels cs.lg distributed multi novel sensors setup stat.ml through type while

More from arxiv.org / cs.LG updates on arXiv.org

Bayesian identification of nonseparable Hamiltonians with multiplicative noise using deep learning and reduced-order modeling 1 day, 16 hours ago | arxiv.org

abstract arxiv bayesian cs.lg +17

MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning 1 day, 16 hours ago | arxiv.org

abstract analysis arxiv cs.cv +16

Self-Supervised Detection of Perfect and Partial Input-Dependent Symmetries 1 day, 16 hours ago | arxiv.org

arxiv cs.cv cs.lg detection +3

MixerFlow: MLP-Mixer meets Normalising Flows 1 day, 16 hours ago | arxiv.org

abstract architectures arxiv context +15

Machine Learning-Enabled Software and System Architecture Frameworks 1 day, 16 hours ago | arxiv.org

abstract architecture arxiv concerns +22

Efficient Interaction-Aware Interval Analysis of Neural Network Feedback Loops 1 day, 16 hours ago | arxiv.org

abstract analysis arxiv cs.lg +19

Kernelised Normalising Flows 1 day, 16 hours ago | arxiv.org

abstract architecture arxiv capabilities +14

GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism 1 day, 16 hours ago | arxiv.org

abstract arxiv class cs.dc +25

Reinforcement Learning in Credit Scoring and Underwriting 1 day, 16 hours ago | arxiv.org

abstract action adapt arxiv +17

Quantitative Researcher – Algorithmic Research

@ Man Group | GB London Riverbank House

View on ai-jobs.net

Software Engineering Expert

@ Sanofi | Budapest

View on ai-jobs.net

Senior Bioinformatics Scientist

@ Illumina | US - Bay Area - Foster City

View on ai-jobs.net

Senior Engineer - Generative AI Product Engineering (Remote-Eligible)

@ Capital One | McLean, VA

View on ai-jobs.net

Graduate Assistant - Bioinformatics

@ University of Arkansas System | University of Arkansas at Little Rock

View on ai-jobs.net

Senior AI-HPC Cluster Engineer

@ NVIDIA | US, CA, Santa Clara

View on ai-jobs.net