all AI news
Meta Learning MDPs with Linear Transition Models. (arXiv:2201.08732v1 [cs.LG])
Jan. 24, 2022, 2:10 a.m. | Robert Müller, Aldo Pacchiano
cs.LG updates on arXiv.org arxiv.org
We study meta-learning in Markov Decision Processes (MDP) with linear
transition models in the undiscounted episodic setting. Under a task sharedness
metric based on model proximity we study task families characterized by a
distribution over models specified by a bias term and a variance component. We
then propose BUC-MatrixRL, a version of the UC-Matrix RL algorithm, and show it
can meaningfully leverage a set of sampled training tasks to quickly solve a
test task sampled from the same task distribution …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Strategy & Management - Private Equity Sector - Manager - Consulting - Location OPEN
@ EY | New York City, US, 10001-8604
Data Engineer- People Analytics
@ Volvo Group | Gothenburg, SE, 40531