DrS: Learning Reusable Dense Rewards for Multi-Stage Tasks | allainews.com

April 26, 2024, 4:42 a.m. | Tongzhou Mu, Minghua Liu, Hao Su

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.16779v1 Announce Type: new
Abstract: The success of many RL techniques heavily relies on human-engineered dense rewards, which typically demand substantial domain expertise and extensive trial and error. In our work, we propose DrS (Dense reward learning from Stages), a novel approach for learning reusable dense rewards for multi-stage tasks in a data-driven manner. By leveraging the stage structures of the task, DrS learns a high-quality dense reward from sparse rewards and demonstrations if given. The learned rewards can be …

arxiv cs.ai cs.lg cs.ro stage tasks type

More from arxiv.org / cs.LG updates on arXiv.org

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration an hour ago | arxiv.org

abstract arxiv attention attention mechanisms +23

Link Me Baby One More Time: Social Music Discovery on Spotify an hour ago | arxiv.org

abstract arxiv baby cs.ir +15

Risk-anticipatory autonomous driving strategies considering vehicles' weights, based on hierarchical deep reinforcement learning an hour ago | arxiv.org

abstract accidents arxiv autonomous +20

An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models an hour ago | arxiv.org

abstract annotation arxiv capabilities +21

Toward Deep Drum Source Separation an hour ago | arxiv.org

abstract adoption applications arxiv +14

CLIP as RNN: Segment Countless Visual Concepts without Training Endeavor an hour ago | arxiv.org

abstract arxiv capacity clip +21

Towards Optimal Sobolev Norm Rates for the Vector-Valued Regularized Least-Squares Algorithm an hour ago | arxiv.org

abstract algorithm arxiv case +14

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios an hour ago | arxiv.org

abstract arxiv challenges cs.ai +15

SySMOL: Co-designing Algorithms and Hardware for Neural Networks with Heterogeneous Precisions an hour ago | arxiv.org

abstract accuracy algorithms arxiv +14

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net