Multi-Stage Episodic Control for Strategic Exploration in Text Games. (arXiv:2201.01251v1 [cs.CL]) | allainews.com

Jan. 5, 2022, 2:10 a.m. | Jens Tuyls, Shunyu Yao, Sham Kakade, Karthik Narasimhan

cs.CL updates on arXiv.org arxiv.org

Text adventure games present unique challenges to reinforcement learning
methods due to their combinatorially large action spaces and sparse rewards.
The interplay of these two factors is particularly demanding because large
action spaces require extensive exploration, while sparse rewards provide
limited feedback. This work proposes to tackle the explore-vs-exploit dilemma
using a multi-stage approach that explicitly disentangles these two strategies
within each episode. Our algorithm, called eXploit-Then-eXplore (XTX), begins
each episode using an exploitation policy that imitates a set of …

arxiv exploration games stage text

More from arxiv.org / cs.CL updates on arXiv.org

A Text Classification Framework for Simple and Effective Early Depression Detection Over Social Media Streams 3 hours ago | arxiv.org

abstract arxiv build classification +22

A Survey on Prompting Techniques in LLMs 3 hours ago | arxiv.org

abstract arxiv autoregressive cs.ai +24

Enabling On-Device Large Language Model Personalization with Self-Supervised Data Selection and Synthesis 3 hours ago | arxiv.org

abstract arxiv conversation cs.cl +21

ML-Bench: Evaluating Large Language Models for Code Generation in Repository-Level Machine Learning Tasks 3 hours ago | arxiv.org

arxiv code code generation cs.ai +9

Strings from the Library of Babel: Random Sampling as a Strong Baseline for Prompt Optimisation 3 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 3 hours ago | arxiv.org

abstract arxiv case case study +19

Formal Aspects of Language Modeling 3 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +24

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model 3 hours ago | arxiv.org

abstract advanced arxiv challenges +24

Predicting Emergent Abilities with Infinite Resolution Evaluation 3 hours ago | arxiv.org

abstract arxiv cs.cl evaluation +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Data Science Sustainability Co-Op (Summer & Fall 2024)

@ O-I | Perrysburg, OH, United States

View on ai-jobs.net

Research Scientist

@ Chevron Phillips Chemical Company | USA: Kingwood, TX, US, 77339

View on ai-jobs.net

Data Scientist Python (Django) (m/f/d)

@ RoomPriceGenie | Hybrid Mannheim, Remote DACH, Remote Germany

View on ai-jobs.net

Operational Transformation & Strategy - Data Operations - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India

View on ai-jobs.net

Senior Data Scientist

@ Rocket Travel | Chicago, IL USA

View on ai-jobs.net