Q-learning and Sarsa in grid environment for short-term vs long-term rewards

Jan. 11, 2022, 10:07 a.m. | /u/studentani

Artificial Intelligence www.reddit.com

I created my custom, grid(7 by 7) environment to apply RL algorithms. I chose Q-learning and Sarsa, in particular.

The grid environment consists of 3 types of terminating states: states with negative reward(-100), state with maximum reward(100) and 2 states with half reward(50).

The main goal of training is for the agent to avoid states with negative rewards and to prefer long-term reward(100) over short-term half reward(50).

The trained agent works weirdly when the half-rewarded state is closer to the …

artificial environment learning

Visit resource

More from www.reddit.com / Artificial Intelligence

Company Wants To Address Euro Teacher Shortage With AI By Using Avatars To Teach Maths 2 hours ago | www.reddit.com

artificial avatars maths shortage

Survey reveals translators and illustrators losing work to AI 4 hours ago | www.reddit.com

artificial compensation concerns consent +10

Apple releases eight small AI language models aimed at on-device use 4 hours ago | www.reddit.com

ai language models apple artificial code +15

How the First AI Project on CoinList in 2024 is Poised to Disrupt the AI … 14 hours ago | www.reddit.com

ai industry artificial disrupt industry +1

Udio strikes again! 19 hours ago | www.reddit.com

artificial strikes udio

Discussing the challenges of implementing generative AI in companies 21 hours ago | www.reddit.com

ai investments artificial challenges companies +9

Ars Technica article on Reddits new AI advertising bots 23 hours ago | www.reddit.com

artificial big good journalism +4

Visualizing AI Patents by Country 1 day, 9 hours ago | www.reddit.com

artificial country patents

One-Minute Daily AI News 4/24/2024 1 day, 10 hours ago | www.reddit.com

adobe ai news ai stack artificial +22

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior AI & Data Engineer

@ Bertelsmann | Kuala Lumpur, 14, MY, 50400

View on ai-jobs.net

Analytics Engineer

@ Reverse Tech | Philippines - Remote

View on ai-jobs.net

View more jobs

all AI news

Q-learning and Sarsa in grid environment for short-term vs long-term rewards

More from www.reddit.com / Artificial Intelligence

Jobs in AI, ML, Big Data

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Senior AI & Data Engineer

Analytics Engineer