Implicit differentiation for fast hyperparameter selection in non-smooth convex learning. (arXiv:2105.01637v3 [stat.ML] UPDATED) | allainews.com

Aug. 10, 2022, 1:11 a.m. | Quentin Bertrand, Quentin Klopfenstein, Mathurin Massias, Mathieu Blondel, Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

stat.ML updates on arXiv.org arxiv.org

Finding the optimal hyperparameters of a model can be cast as a bilevel
optimization problem, typically solved using zero-order techniques. In this
work we study first-order methods when the inner optimization problem is convex
but non-smooth. We show that the forward-mode differentiation of proximal
gradient descent and proximal coordinate descent yield sequences of Jacobians
converging toward the exact Jacobian. Using implicit differentiation, we show
it is possible to leverage the non-smoothness of the inner problem to speed up
the computation. …

arxiv differentiation learning ml

More from arxiv.org / stat.ML updates on arXiv.org

Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions 20 hours ago | arxiv.org

abstract arxiv bias canonical +16

Dropout Regularization Versus $\ell_2$-Penalization in the Linear Model 20 hours ago | arxiv.org

abstract arxiv behavior convergence +15

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model 20 hours ago | arxiv.org

abstract algorithm arxiv block +15

Estimating the Number of Components in Finite Mixture Models via Variational Approximation 20 hours ago | arxiv.org

abstract approximation arxiv bayes +11

Conformalized Ordinal Classification with Marginal and Conditional Coverage 20 hours ago | arxiv.org

abstract algorithm applications arxiv +16

Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning 1 day, 5 hours ago | arxiv.org

abstract arxiv effects machine +15

Spatial best linear unbiased prediction: A computational mathematics approach for high dimensional massive datasets 1 day, 5 hours ago | arxiv.org

abstract arxiv challenges classification +20

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 2 days, 20 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 2 days, 20 hours ago | arxiv.org

abstract arxiv confidence construct +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Business Intelligence Developer / Analyst

@ Transamerica | Work From Home, USA

View on ai-jobs.net

Data Analyst (All Levels)

@ Noblis | Bethesda, MD, United States

View on ai-jobs.net