https://www.reddit.com/r/reinforcementlearning/comments/sd3ub2/combining_reward_functions_with_different_scales/

Jan. 26, 2022, 11:27 a.m. | /u/fedetask

Reinforcement Learning reddit.com

What are the best practices to use a reward function that is a combination of several types of rewards, that can have very different scales and meanings?

Take the example of a robot serving customers at a restaurant. I want it to maximize the number of dishes it serves during the day, but I also want to penalize it for making customers wait more than a certain amount of time. Note that without the penalization the robot might decide to …


