A Risk-Sensitive Approach to Policy Optimization. (arXiv:2208.09106v1 [cs.LG]) | allainews.com

Aug. 22, 2022, 1:10 a.m. | Jared Markowitz, Ryan W. Gardner, Ashley Llorens, Raman Arora, I-Jeng Wang

cs.LG updates on arXiv.org arxiv.org

Standard deep reinforcement learning (DRL) aims to maximize expected reward,
considering collected experiences equally in formulating a policy. This differs
from human decision-making, where gains and losses are valued differently and
outlying outcomes are given increased consideration. It also fails to
capitalize on opportunities to improve safety and/or performance through the
incorporation of distributional context. Several approaches to distributional
DRL have been investigated, with one popular strategy being to evaluate the
projected distribution of returns for possible actions. We propose …

arxiv lg optimization policy risk

More from arxiv.org / cs.LG updates on arXiv.org

Discovering Nuclear Models from Symbolic Machine Learning 16 hours ago | arxiv.org

abstract arxiv behavior challenge +12

Advancing Network Intrusion Detection: Integrating Graph Neural Networks with Scattering Transform and Node2Vec for Enhanced … 16 hours ago | arxiv.org

abstract analysis anomaly anomaly detection +19

A Closer Look at Spatial-Slice Features Learning for COVID-19 Detection 16 hours ago | arxiv.org

arxiv closer look covid covid-19 +9

RELIANCE: Reliable Ensemble Learning for Information and News Credibility Evaluation 16 hours ago | arxiv.org

abstract arxiv challenge cs.cl +19

Artwork Protection Against Neural Style Transfer Using Locally Adaptive Adversarial Color Attack 16 hours ago | arxiv.org

abstract adversarial artists artwork +18

GestaltMML: Enhancing Rare Genetic Disease Diagnosis through Multimodal Machine Learning Combining Facial Images and Clinical … 16 hours ago | arxiv.org

abstract arxiv clinical cs.cv +19

Isolated pulsar population synthesis with simulation-based inference 16 hours ago | arxiv.org

abstract arxiv astro-ph.he astro-ph.im +15

Domain-Specific Fine-Tuning of Large Language Models for Interactive Robot Programming 16 hours ago | arxiv.org

abstract advanced applications arxiv +27

Training of Neural Networks with Uncertain Data -- A Mixture of Experts Approach 16 hours ago | arxiv.org

abstract arxiv cs.lg data +17

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Data Engineer

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

View on ai-jobs.net

Senior Machine Learning Engineer

@ TELUS | Vancouver, BC, CA

View on ai-jobs.net

CT Technologist - Ambulatory Imaging - PRN

@ Duke University | Morriville, NC, US, 27560

View on ai-jobs.net

BH Data Analyst

@ City of Philadelphia | Philadelphia, PA, United States

View on ai-jobs.net