Debiasing Conditional Stochastic Optimization. (arXiv:2304.10613v1 [cs.LG]) | allainews.com

April 24, 2023, 12:44 a.m. | Lie He, Shiva Prasad Kasiviswanathan

stat.ML updates on arXiv.org arxiv.org

In this paper, we study the conditional stochastic optimization (CSO) problem
which covers a variety of applications including portfolio selection,
reinforcement learning, robust learning, causal inference, etc. The
sample-averaged gradient of the CSO objective is biased due to its nested
structure and therefore requires a high sample complexity to reach convergence.
We introduce a general stochastic extrapolation technique that effectively
reduces the bias. We show that for nonconvex smooth objectives, combining this
extrapolation with variance reduction techniques can achieve a …

algorithms applications arxiv bias causal inference complexity convergence cso etc general gradient inference optimization paper portfolio reinforcement reinforcement learning stochastic study variance

More from arxiv.org / stat.ML updates on arXiv.org

Seeded graph matching for the correlated Gaussian Wigner model via the projected power method 14 hours ago | arxiv.org

abstract agreement arxiv edge +10

Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms 14 hours ago | arxiv.org

abstract algorithms analyze arxiv +11

Mixture of partially linear experts 14 hours ago | arxiv.org

abstract arxiv benefits computational +9

Adaptive deep density approximation for stochastic dynamical systems 14 hours ago | arxiv.org

abstract approximation arxiv cs.na +16

Unsupervised Learning of Phylogenetic Trees via Split-Weight Embedding 1 day, 14 hours ago | arxiv.org

abstract accuracy applications arxiv +21

Decolonial AI Alignment: Openness, Vi\'{s}e\d{s}a-Dharma, and Including Excluded Knowledges 1 day, 14 hours ago | arxiv.org

abstract ai alignment alignment artificial +23

Adaptive deep learning for nonlinear time series models 1 day, 14 hours ago | arxiv.org

abstract arxiv deep learning dnn +15

A Full Adagrad algorithm with O(Nd) operations 1 day, 14 hours ago | arxiv.org

abstract algorithm arxiv challenges +16

Minimax Regret Learning for Data with Heterogeneous Subgroups 1 day, 14 hours ago | arxiv.org

abstract applications arxiv data +15

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net