Feb. 8, 2024, 5:43 a.m. | Kalle Kujanp\"a\"a Amin Babadi Yi Zhao Juho Kannala Alexander Ilin Joni Pajarinen

cs.LG updates on arXiv.org arxiv.org

Online planning is crucial for high performance in many complex sequential decision-making tasks. Monte Carlo Tree Search (MCTS) employs a principled mechanism for trading off exploration for exploitation for efficient online planning, and it outperforms comparison methods in many discrete decision-making domains such as Go, Chess, and Shogi. Subsequently, extensions of MCTS to continuous domains have been developed. However, the inherent high branching factor and the resulting explosion of the search tree size are limiting the existing methods. To address …

chess comparison continuous cs.ai cs.lg cs.ro decision domains exploitation exploration extensions graph making performance planning search tasks trading tree

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne