all AI news
Conservative Q Learning TD error not converging
Web: https://www.reddit.com/r/reinforcementlearning/comments/s87v2x/conservative_q_learning_td_error_not_converging/
Jan. 20, 2022, 2:36 a.m. | /u/superkaiba
Reinforcement Learning reddit.com
Hi, I am using the discrete conservative Q learning implementation in the d3rlpy library (https://github.com/takuseno/d3rlpy) to train a policy offline to optimize mechanical ventilation treatment by using the MIMIC-III dataset (https://physionet.org/content/mimiciii-demo/1.4/).
The state space for my problem is a set of 38 measurements taken from the MIMIC-III dataset such as heartrate, blood pressure, etc.
The action space is a combination of 3 settings (Positive end-expiratory pressure, fraction of inspired oxygen and adjusted tidal volume) on the …
!-->More from reddit.com / Reinforcement Learning
Offered a position as an RL Engineer - Seeking Advice
1 day, 5 hours ago |
reddit.com
Help with MADDPG on Food Collector (Unity ML-Agents)
2 days, 8 hours ago |
reddit.com
Latest AI/ML/Big Data Jobs
Research Scientist, 3D Reconstruction
@ Yembo | Remote, US
Clinical Assistant or Associate Professor of Management Science and Systems
@ University at Buffalo | Buffalo, NY
Data Analyst
@ Colorado Springs Police Department | Colorado Springs, CO
Predictive Ecology Postdoctoral Fellow
@ Lawrence Berkeley National Lab | Berkeley, CA
Data Analyst, Patagonia Action Works
@ Patagonia | Remote
Data & Insights Strategy & Innovation General Manager
@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX