Efficient Global Optimization of Two-layer ReLU Networks: Quadratic-time Algorithms and Adversarial Training. (arXiv:2201.01965v1 [cs.LG]) | allainews.com

Jan. 7, 2022, 2:10 a.m. | Yatong Bai, Tanmay Gautam, Somayeh Sojoudi

cs.LG updates on arXiv.org arxiv.org

The non-convexity of the artificial neural network (ANN) training landscape
brings inherent optimization difficulties. While the traditional
back-propagation stochastic gradient descent (SGD) algorithm and its variants
are effective in certain cases, they can become stuck at spurious local minima
and are sensitive to initializations and hyperparameters. Recent work has shown
that the training of an ANN with ReLU activations can be reformulated as a
convex program, bringing hope to globally optimizing interpretable ANNs.
However, naively solving the convex training formulation …

algorithms arxiv global networks optimization time training

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 1 day, 8 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 1 day, 8 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 1 day, 8 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 1 day, 8 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 1 day, 8 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 1 day, 8 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 1 day, 8 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 1 day, 8 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 1 day, 8 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Software Engineer - Artificial Intelligence, LLM

@ OpenText | Hyderabad, TG, IN

View on ai-jobs.net

Lead Software Engineer- Python Data Engineer

@ JPMorgan Chase & Co. | GLASGOW, LANARKSHIRE, United Kingdom

View on ai-jobs.net

Data Analyst (m/w/d)

@ Collaboration Betters The World | Berlin, Germany

View on ai-jobs.net

Data Engineer, Quality Assurance

@ Informa Group Plc. | Boulder, CO, United States

View on ai-jobs.net

Director, Data Science - Marketing

@ Dropbox | Remote - Canada

View on ai-jobs.net