[D] State or Action based rewards for RL | allainews.com

Jan. 10, 2022, 4:14 a.m. | /u/ddcfefff

Machine Learning www.reddit.com

Is it better to have the reward function be better when the agent makes a good move, or if it’s in a good state. Eg: the reward for an agent in a good state that makes a net negative move is higher than that of an agent in a bad state that makes a net negative move or vice versa.

The most basic example I can think is if you have an env with a “target” input of 1 or …

machinelearning rl

More from www.reddit.com / Machine Learning

[D] Is there MoE implemented for less than 1B total parameters? 3 hours ago | www.reddit.com

architectures machinelearning moe parameters +2

[D] Do Swin transformers perform poorly at linear probing? 4 hours ago | www.reddit.com

accuracy linear machinelearning paper +5

Stanford releases their rather comprehensive (500 page) "2004 AI Index Report summarizing the state of … 5 hours ago | www.reddit.com

ai index report index machinelearning page +6

[R] Unified Training of Universal Time Series Forecasting Transformers 6 hours ago | www.reddit.com

abstract collection concept dataset +19

[R] Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length 8 hours ago | www.reddit.com

context inference llm machinelearning +1

Diffusion versus Auto-regressive models for image generation. Which is better? [D] [R] 11 hours ago | www.reddit.com

auto diffusion diffusion models distribution +10

[D] SOTA in Object Detection? 21 hours ago | www.reddit.com

architectures detection improvements machinelearning +10

[D] Why is the latent channel of stable diffusion so small? 22 hours ago | www.reddit.com

channels diffusion encode generative +5

[D] Cold emailing a researcher for collaboration, should I be cautious ? 1 day ago | www.reddit.com

collaboration machinelearning master paper +5

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Sr. Data Science Consultant

@ Blue Yonder | Bengaluru

View on ai-jobs.net

Artificial Intelligence Developer

@ HP | PSR01 - Bengaluru, Pritech Park- SEZ (PSR01)

View on ai-jobs.net

Senior Software Engineer - Cloud Data Extraction

@ Celonis | Munich, Germany

View on ai-jobs.net

Finance Master Data Management

@ Airbus | Lisbon (Airbus Portugal)

View on ai-jobs.net

Imaging Support Associate

@ Lexington Medical Center | West Columbia, SC, US, 29169

View on ai-jobs.net