Web: https://www.reddit.com/r/reinforcementlearning/comments/sd3ub2/combining_reward_functions_with_different_scales/

Jan. 26, 2022, 11:27 a.m. | /u/fedetask

Reinforcement Learning reddit.com

What are the best practices to use a reward function that is a combination of several types of rewards, that can have very different scales and meanings?

Take the example of a robot serving customers at a restaurant. I want it to maximize the number of dishes it serves during the day, but I also want to penalize it for making customers wait more than a certain amount of time. Note that without the penalization the robot might decide to …


Director, Data Engineering and Architecture

@ Chainalysis | California | New York | Washington DC | Remote - USA

Deep Learning Researcher

@ Topaz Labs | Dallas, TX

Sr Data Engineer (Contractor)

@ SADA | US - West

Senior Cloud Database Administrator

@ Findhelp | Remote

Senior Data Analyst

@ System1 | Remote

Speech Machine Learning Research Engineer

@ Samsung Research America | Mountain View, CA