Nov. 3, 2022, 1:12 a.m. | Alberto Bemporad

cs.LG updates on arXiv.org arxiv.org

This paper investigates the use of extended Kalman filtering to train
recurrent neural networks with rather general convex loss functions and
regularization terms on the network parameters, including
$\ell_1$-regularization. We show that the learning method is competitive with
respect to stochastic gradient descent in a nonlinear system identification
benchmark and in training a linear system with binary outputs. We also explore
the use of the algorithm in data-driven nonlinear model predictive control and
its relation with disturbance models for offset-free …

arxiv filtering loss network network training neural network recurrent neural network regularization training

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne