Multi agent deep q learning takes a dive at exploitation state [D] | allainews.com

March 1, 2024, 1:04 p.m. | /u/ripototo

Machine Learning www.reddit.com

I am using double & dueling deep q learning. shortly after reaching epsilon 0.01, the reward starts to go downhill. I am experimenting with different hyper parameters, but would be interested in any similar experiences/ideas.

My guess is that since it is a multi agent scenario, most of the exploration stage, the agents learn the best actions, given kind of random actions from the rest. once epsilon reaches 0.01, the behaviors of the rest of the agents (and thus the …

agent epsilon exploitation ideas machinelearning parameters state

More from www.reddit.com / Machine Learning

A Multi-Agent game where LLMs must trick each other as humans until one gets caught … 8 hours ago | www.reddit.com

agent fun game humans +7

[D] How reliable is RAG currently? 9 hours ago | www.reddit.com

context context window documents machinelearning +5

[N] New Challenges in DIAMBRA Arena: 3 epic additions to our lineup of RL environments! 9 hours ago | www.reddit.com

arena challenges environments epic +1

[R] An Analysis of Linear Time Series Forecasting Models 11 hours ago | www.reddit.com

abstract analysis forecasting form +9

[D] The "it" in AI models is really just the dataset? 12 hours ago | www.reddit.com

ai models dataset machinelearning

[D] Analysis of Time To First Token (TTFT) of LLMs (10B-34B) 14 hours ago | www.reddit.com

analysis containers docker hey +10

[P] Open Source / Projects Based Machine Learning Community? 17 hours ago | www.reddit.com

building collaborations community devs +16

[R] DDPM for Timeseries Generation 19 hours ago | www.reddit.com

column data data generation dataset +13

[P] [D] Examples of client projects that you have delivered 20 hours ago | www.reddit.com

client consulting examples freelance +6

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net