Web: https://www.reddit.com/r/reinforcementlearning/comments/s0ig21/qlearning_with_shortterm_vs_longterm_rewards/

Jan. 10, 2022, 11:59 a.m. | /u/studentani

Reinforcement Learning reddit.com

Hey guys, I have implemented and applied the Q-learning algorithm to the simple, grid environment. I defined terminating states, one with positive reward and others with negative rewards. And the training worked pretty well. Now, I wanted to enhance the process by adding the state with half reward,i.e., now there are 3 types of terminating states - a state with the biggest positive reward(100), a state with half of the reward(50), and the state with negative rewards(-100). As I said, they all terminate the process. However, when I test the …

learning reinforcementlearning

Statistics and Computer Science Specialist

@ Hawk-Research | Remote

Data Scientist, Credit/Fraud Strategy

@ Fora Financial | New York City

Postdoctoral Research Associate - Biomedical Natural Language Processing and Deep Learning

@ Oak Ridge National Laboratory - Oak Ridge, TN | Oak Ridge, TN, United States

Senior Machine Learning / Computer Vision Engineer

@ Glass Imaging | Los Altos, CA

Research Scientist in Biomedical Natural Language Processing and Deep Learning

@ Oak Ridge National Laboratory | Oak Ridge, TN

W3-Professorship for Intelligent Energy Management

@ Universität Bayreuth | Bayreuth, Germany