all AI news
Brick Tic-Tac-Toe: Exploring the Generalizability of AlphaZero to Novel Test Environments. (arXiv:2207.05991v2 [cs.LG] UPDATED)
July 15, 2022, 1:11 a.m. | John Tan Chong Min, Mehul Motani
cs.LG updates on arXiv.org arxiv.org
Traditional reinforcement learning (RL) environments typically are the same
for both the training and testing phases. Hence, current RL methods are largely
not generalizable to a test environment which is conceptually similar but
different from what the method has been trained on, which we term the novel
test environment. As an effort to push RL research towards algorithms which can
generalize to novel test environments, we introduce the Brick Tic-Tac-Toe
(BTTT) test bed, where the brick position in the test …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
C003549 Data Analyst (NS) - MON 13 May
@ EMW, Inc. | Braine-l'Alleud, Wallonia, Belgium
Marketing Decision Scientist
@ Meta | Menlo Park, CA | New York City