Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning. (arXiv:2205.06000v1 [cs.LG]) | allainews.com

May 13, 2022, 1:11 a.m. | Nathan Michlo, Devon Jarvis, Richard Klein, Steven James

cs.LG updates on arXiv.org arxiv.org

In this work, we investigate the properties of data that cause popular
representation learning approaches to fail. In particular, we find that in
environments where states do not significantly overlap, variational
autoencoders (VAEs) fail to learn useful features. We demonstrate this failure
in a simple gridworld domain, and then provide a solution in the form of metric
learning. However, metric learning requires supervision in the form of a
distance function, which is absent in reinforcement learning. To overcome this,
we …

accounting arxiv features learning reinforcement reinforcement learning

More from arxiv.org / cs.LG updates on arXiv.org

REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback 10 hours ago | arxiv.org

abstract agents arxiv continuous +19

Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection 10 hours ago | arxiv.org

abstract annotations anomaly anomaly detection +21

Unraveling Batch Normalization for Realistic Test-Time Adaptation 10 hours ago | arxiv.org

arxiv cs.cv cs.lg normalization +2

The Effective Horizon Explains Deep RL Performance in Stochastic Environments 10 hours ago | arxiv.org

arxiv cs.ai cs.lg deep rl +6

FM-G-CAM: A Holistic Approach for Explainable AI in Computer Vision 10 hours ago | arxiv.org

abstract arxiv cnn computer +20

Generating Illustrated Instructions 10 hours ago | arxiv.org

abstract arxiv cs.ai cs.cv +11

Dancing with Still Images: Video Distillation via Static-Dynamic Disentanglement 10 hours ago | arxiv.org

arxiv cs.cv cs.lg dancing +6

SatCLIP: Global, General-Purpose Location Embeddings with Satellite Imagery 10 hours ago | arxiv.org

abstract arxiv challenge cs.ai +22

A precise symbolic emulator of the linear matter power spectrum 10 hours ago | arxiv.org

abstract applications arxiv astro-ph.co +15

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Associate Data Engineer

@ Redkite | London, England, United Kingdom

View on ai-jobs.net

Data Management Associate Consultant

@ SAP | Porto Salvo, PT, 2740-262

View on ai-jobs.net

NLP & Data Modelling Consultant - SAP LABS

@ SAP | Bengaluru, IN, 560066

View on ai-jobs.net

Catalog Data Quality Specialist

@ Delivery Hero | Montevideo, Uruguay

View on ai-jobs.net

Data Analyst for CEO Office with Pathway to Functional Analyst

@ Amar Bank | Jakarta

View on ai-jobs.net