May 9, 2024, 4:42 a.m. | Zhong Zheng, Fengyu Gao, Lingzhou Xue, Jing Yang

cs.LG updates on arXiv.org arxiv.org

arXiv:2312.15023v2 Announce Type: replace
Abstract: In this paper, we consider federated reinforcement learning for tabular episodic Markov Decision Processes (MDP) where, under the coordination of a central server, multiple agents collaboratively explore the environment and learn an optimal policy without sharing their raw data. While linear speedup in the number of agents has been achieved for some metrics, such as convergence rate and sample complexity, in similar settings, it is unclear whether it is possible to design a model-free algorithm …

abstract agents arxiv communication cost cs.lg data decision environment explore learn linear low markov multiple paper policy processes q-learning raw raw data reinforcement reinforcement learning server stat.ml tabular the environment type while

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US