Nov. 21, 2023, 5:51 p.m. | Ryan Pégoud

Towards Data Science - Medium towardsdatascience.com

Solving the CartPole environment with DQN in under a second

Photo by Thomas Despeyroux on Unsplash

Recent progress in Reinforcement Learning (RL), such as Waymo’s autonomous taxis or DeepMind’s superhuman chess-playing agents, complement classical RL with Deep Learning components such as Neural Networks and Gradient Optimization methods.

Building on the foundations and coding principles introduced in one of my previous stories, we’ll discover and learn to implement Deep Q-Networks (DQN) and replay buffers to solve OpenAI’s CartPole environment. …

deep learning getting-started jax machine learning reinforcement learning

Lecturer in Social Data Analytics

@ The University of Hong Kong | Hong Kong

Spatial Data Engineer

@ HERE Technologies | New Cairo, Egypt

Senior Cyber Software and Machine Learning Engineer

@ Draper | Cambridge, MA, United States

Senior Principal Software Engineer, Data Quality

@ Red Hat | Boston, United States

Principal Sotware Engineeer - Technical Data Architecture

@ Red Hat | Remote, Ireland

Data Management Associate

@ EcoVadis | Ebène, Mauritius