Revisiting Discrete Soft Actor-Critic. (arXiv:2209.10081v2 [cs.LG] UPDATED) | allainews.com

Sept. 23, 2022, 1:12 a.m. | Haibin Zhou, Zichuan Lin, Junyou Li, Deheng Ye, Qiang Fu, Wei Yang

cs.LG updates on arXiv.org arxiv.org

We study the adaption of soft actor-critic (SAC) from continuous action space
to discrete action space. We revisit vanilla SAC and provide an in-depth
understanding of its Q value underestimation and performance instability issues
when applied to discrete settings. We thereby propose entropy-penalty and
double average Q-learning with Q-clip to address these issues. Extensive
experiments on typical benchmarks with discrete action space, including Atari
games and a large-scale MOBA game, show the efficacy of our proposed method.
Our code is …

actor-critic arxiv

More from arxiv.org / cs.LG updates on arXiv.org

REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback 9 hours ago | arxiv.org

abstract agents arxiv continuous +19

Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection 9 hours ago | arxiv.org

abstract annotations anomaly anomaly detection +21

Unraveling Batch Normalization for Realistic Test-Time Adaptation 9 hours ago | arxiv.org

arxiv cs.cv cs.lg normalization +2

The Effective Horizon Explains Deep RL Performance in Stochastic Environments 9 hours ago | arxiv.org

arxiv cs.ai cs.lg deep rl +6

FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision 9 hours ago | arxiv.org

abstract arxiv cnn computer +20

Generating Illustrated Instructions 9 hours ago | arxiv.org

abstract arxiv cs.ai cs.cv +11

Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement 9 hours ago | arxiv.org

arxiv cs.cv cs.lg dancing +6

SatCLIP: Global, General-Purpose Location Embeddings with Satellite Imagery 9 hours ago | arxiv.org

abstract arxiv challenge cs.ai +22

A precise symbolic emulator of the linear matter power spectrum 9 hours ago | arxiv.org

abstract applications arxiv astro-ph.co +15

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Associate Data Engineer

@ Redkite | London, England, United Kingdom

View on ai-jobs.net

Data Management Associate Consultant

@ SAP | Porto Salvo, PT, 2740-262

View on ai-jobs.net

NLP & Data Modelling Consultant - SAP LABS

@ SAP | Bengaluru, IN, 560066

View on ai-jobs.net

Catalog Data Quality Specialist

@ Delivery Hero | Montevideo, Uruguay

View on ai-jobs.net

Data Analyst for CEO Office with Pathway to Functional Analyst

@ Amar Bank | Jakarta

View on ai-jobs.net