Efficient Scheduling of Data Augmentation for Deep Reinforcement Learning. (arXiv:2102.08581v3 [cs.LG] UPDATED) | allainews.com

Oct. 21, 2022, 1:13 a.m. | Byungchan Ko, Jungseul Ok

cs.LG updates on arXiv.org arxiv.org

In deep reinforcement learning (RL), data augmentation is widely considered
as a tool to induce a set of useful priors about semantic consistency and
improve sample efficiency and generalization performance. However, even when
the prior is useful for generalization, distilling it to RL agent often
interferes with RL training and degenerates sample efficiency. Meanwhile, the
agent is forgetful of the prior due to the non-stationary nature of RL. These
observations suggest two extreme schedules of distillation: (i) over the entire …

arxiv augmentation data reinforcement reinforcement learning scheduling

More from arxiv.org / cs.LG updates on arXiv.org

Learning to Manipulate under Limited Information 22 hours ago | arxiv.org

abstract arxiv become cs.ai +13

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction … 22 hours ago | arxiv.org

abstract alignment arxiv cs.ai +17

Evolutionary Optimization of 1D-CNN for Non-contact Respiration Pattern Classification 22 hours ago | arxiv.org

abstract arxiv classification cnn +17

Regularization by Texts for Latent Diffusion Inverse Solvers 22 hours ago | arxiv.org

abstract arxiv challenges cs.ai +10

A Systematic Review of Aspect-based Sentiment Analysis (ABSA): Domains, Methods, and Trends 22 hours ago | arxiv.org

abstract analysis arxiv cs.cl +13

Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models 22 hours ago | arxiv.org

abstract arxiv control cs.lg +16

In-Context Learning Dynamics with Random Binary Sequences 22 hours ago | arxiv.org

abstract art arxiv binary +24

Sharp error bounds for imbalanced classification: how many examples in the minority class? 22 hours ago | arxiv.org

abstract arxiv class classification +15

When can transformers reason with abstract symbols? 22 hours ago | arxiv.org

abstract arxiv capabilities cs.ai +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Data Operations Analyst

@ Workday | Poland, Warsaw

View on ai-jobs.net

Reference Data Specialist - Operations Analyst

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

View on ai-jobs.net

Data Scientist (Redwood City)

@ Anomali | Redwood City, CA

View on ai-jobs.net

Software Engineer, Database - Languages & Relational Technologies

@ YugabyteDB | United States (Remote) or Sunnyvale, CA

View on ai-jobs.net

Data Analyst (m/f/d) Online Marketing

@ StepStone Group | Düsseldorf, Germany

View on ai-jobs.net