May 13, 2022, 1:11 a.m. | Nathan Michlo, Devon Jarvis, Richard Klein, Steven James

cs.LG updates on arXiv.org arxiv.org

In this work, we investigate the properties of data that cause popular
representation learning approaches to fail. In particular, we find that in
environments where states do not significantly overlap, variational
autoencoders (VAEs) fail to learn useful features. We demonstrate this failure
in a simple gridworld domain, and then provide a solution in the form of metric
learning. However, metric learning requires supervision in the form of a
distance function, which is absent in reinforcement learning. To overcome this,
we …

accounting arxiv features learning reinforcement reinforcement learning

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Associate Data Engineer

@ Redkite | London, England, United Kingdom

Data Management Associate Consultant

@ SAP | Porto Salvo, PT, 2740-262

NLP & Data Modelling Consultant - SAP LABS

@ SAP | Bengaluru, IN, 560066

Catalog Data Quality Specialist

@ Delivery Hero | Montevideo, Uruguay

Data Analyst for CEO Office with Pathway to Functional Analyst

@ Amar Bank | Jakarta