Jan. 31, 2024, 3:47 p.m. | Yunfan Zhao Nikhil Behari Edward Hughes Edwin Zhang Dheeraj Nagaraj Karl Tuyls Aparna Taneja M

cs.LG updates on arXiv.org arxiv.org

Restless multi-arm bandits (RMABs), a class of resource allocation problems with broad application in areas such as healthcare, online advertising, and anti-poaching, have recently been studied from a multi-agent reinforcement learning perspective. Prior RMAB research suffers from several limitations, e.g., it fails to adequately address continuous states, and requires retraining from scratch when arms opt-in and opt-out over time, a common challenge in many real world applications. We address these limitations by developing a neural network-based pre-trained model (PreFeRMAB) that …

advertising agent application arm class continuous cs.ai cs.lg healthcare limitations multi-agent online advertising perspective prior reinforcement reinforcement learning research retraining via

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US