Web: https://www.reddit.com/r/reinforcementlearning/comments/sboerj/continuing_task_broken_into_episodes/

Jan. 24, 2022, 3:14 p.m. | /u/fedetask

Reinforcement Learning reddit.com

I want to train an RL agent on a continuing task (there is no start and end), but I can only simulate a fixed amount of steps. Therefore, I need to train the agent simulating several pieces of tragectories.

Now, in the common episodic task, I would learn the value function using the target y_t = r_t + gamma * V(s_t+1) and, for the last step of the episode, y_T = r_T.

However, in my case, there is no "last …

episodes reinforcementlearning

