Web: https://www.reddit.com/r/reinforcementlearning/comments/sboerj/continuing_task_broken_into_episodes/

Jan. 24, 2022, 3:14 p.m. | /u/fedetask

Reinforcement Learning reddit.com

I want to train an RL agent on a continuing task (there is no start and end), but I can only simulate a fixed amount of steps. Therefore, I need to train the agent simulating several pieces of tragectories.

Now, in the common episodic task, I would learn the value function using the target y_t = r_t + gamma * V(s_t+1) and, for the last step of the episode, y_T = r_T.

However, in my case, there is no "last …

episodes reinforcementlearning

Research Scientist, 3D Reconstruction

@ Yembo | Remote, US

Clinical Assistant or Associate Professor of Management Science and Systems

@ University at Buffalo | Buffalo, NY

Data Analyst

@ Colorado Springs Police Department | Colorado Springs, CO

Predictive Ecology Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

Data Analyst, Patagonia Action Works

@ Patagonia | Remote

Data & Insights Strategy & Innovation General Manager

@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX