POLTER: Policy Trajectory Ensemble Regularization for Unsupervised Reinforcement Learning. (arXiv:2205.11357v2 [cs.LG] UPDATED) | allainews.com

Sept. 8, 2022, 1:11 a.m. | Frederik Schubert, Carolin Benjamins, Sebastian Döhler, Bodo Rosenhahn, Marius Lindauer

cs.LG updates on arXiv.org arxiv.org

The goal of Unsupervised Reinforcement Learning (URL) is to find a
reward-agnostic prior policy on a task domain, such that the sample-efficiency
on supervised downstream tasks is improved. Although agents initialized with
such a prior policy can achieve a significantly higher reward with fewer
samples when finetuned on the downstream task, it is still an open question how
an optimal pretrained prior policy can be achieved in practice. In this work,
we present POLTER (Policy Trajectory Ensemble Regularization) - a …

arxiv ensemble policy regularization reinforcement reinforcement learning unsupervised

More from arxiv.org / cs.LG updates on arXiv.org

Learning to Manipulate under Limited Information 1 day, 20 hours ago | arxiv.org

abstract arxiv become cs.ai +13

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction … 1 day, 20 hours ago | arxiv.org

abstract alignment arxiv cs.ai +17

Evolutionary Optimization of 1D-CNN for Non-contact Respiration Pattern Classification 1 day, 20 hours ago | arxiv.org

abstract arxiv classification cnn +17

Regularization by Texts for Latent Diffusion Inverse Solvers 1 day, 20 hours ago | arxiv.org

abstract arxiv challenges cs.ai +10

A Systematic Review of Aspect-based Sentiment Analysis (ABSA): Domains, Methods, and Trends 1 day, 20 hours ago | arxiv.org

abstract analysis arxiv cs.cl +13

Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models 1 day, 20 hours ago | arxiv.org

abstract arxiv control cs.lg +16

In-Context Learning Dynamics with Random Binary Sequences 1 day, 20 hours ago | arxiv.org

abstract art arxiv binary +24

Sharp error bounds for imbalanced classification: how many examples in the minority class? 1 day, 20 hours ago | arxiv.org

abstract arxiv class classification +15

When can transformers reason with abstract symbols? 1 day, 20 hours ago | arxiv.org

abstract arxiv capabilities cs.ai +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Quality, Senior Analyst

@ Toyota North America | Plano

View on ai-jobs.net

Data Analyst & Audit Management Software (AMS) Coordinator

@ World Vision | Philippines - Home Working

View on ai-jobs.net

Product Manager Power BI Platform Tech I&E Operational Insights

@ ING | HBP (Amsterdam - Haarlerbergpark)

View on ai-jobs.net

Sr. Director, Software Engineering, Clinical Data Strategy

@ Moderna | USA-Washington-Seattle-1099 Stewart Street

View on ai-jobs.net

Data Engineer (Data as a Service)

@ Xplor | Atlanta, GA, United States

View on ai-jobs.net