Web: https://www.reddit.com/r/reinforcementlearning/comments/serz47/environment_for_nonepisodic_continuing_tasks/

Jan. 28, 2022, 2:24 p.m. | /u/Acrobatic-Ad-9189

Reinforcement Learning reddit.com

Hi! I am going to write my master's thesis about using PG /actor critic methods for process optimization. As a case study I think it would be fun to have some visual game-like enviroment to run the algorithms on, as a comparison.

I was therefore looking everywhere to find environment simulations with continuous action spaces that are non-episodic, to on-line learn how to optimize the policies. I thought the Pendulum environment was quite fitting, or the mountaincar-continuous. But these …

environment reinforcementlearning

Senior Data Engineer

@ DAZN | Hammersmith, London, United Kingdom

Sr. Data Engineer, Growth

@ Netflix | Remote, United States

Data Engineer - Remote

@ Craft | Wrocław, Lower Silesian Voivodeship, Poland

Manager, Operations Data Science

@ Binance.US | Vancouver

Senior Machine Learning Researcher for Copilot

@ GitHub | Remote - Europe

Sr. Marketing Data Analyst

@ HoneyBook | San Francisco, CA