Oct. 20, 2022, 1:11 a.m. | Abdus Salam Azad, Izzeddin Gur, Aleksandra Faust, Pieter Abbeel, Ion Stoica

cs.LG updates on arXiv.org arxiv.org

Reinforcement Learning (RL) algorithms are often known for sample
inefficiency and difficult generalization. Recently, Unsupervised Environment
Design (UED) emerged as a new paradigm for zero-shot generalization by
simultaneously learning a task distribution and agent policies on the sampled
tasks. This is a non-stationary process where the task distribution evolves
along with agent policies, creating an instability over time. While past works
demonstrated the potential of such approaches, sampling effectively from the
task space remains an open challenge, bottlenecking these approaches. …

arxiv curriculum curriculum learning representation representation learning unsupervised

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA