DL Notes: Advanced Gradient Descent

Dec. 7, 2023, 8:08 a.m. | Luis Medina

Towards Data Science - Medium towardsdatascience.com

The main optimization algorithms used for training neural networks, explained and implemented from scratch in Python

In my previous article about gradient descent, I explained the basic concepts behind it and summarized the main challenges of this kind of optimization.

However, I only covered Stochastic Gradient Descent (SGD) and the “batch” and “mini-batch” implementation of gradient descent.

Other algorithms offer advantages in terms of convergence speed, robustness to “landscape” features (the vanishing gradient …

data science getting-started machine learning optimization-algorithms programming

Visit resource

More from towardsdatascience.com / Towards Data Science - Medium

Aggregating Real-time Sensor Data with Python and Redpanda an hour ago | towardsdatascience.com

dataframes python real-time-analytics sensor-data-analysis +1

Introducing Time Series in pandas an hour ago | towardsdatascience.com

beginner data data science datetime +10

Why does an Integer Need 28 Bytes in Python? an hour ago | towardsdatascience.com

artificial intelligence data data science integer +7

Why LLMs are not Good for Coding — Part II an hour ago | towardsdatascience.com

artificial intelligence coding data data science +12

A Guide to Powerful Python Enumerations 5 hours ago | towardsdatascience.com

code data data science enumeration +8

Deep Dive on Accumulated Local Effect Plots (ALEs) with Python 15 hours ago | towardsdatascience.com

algorithm code data data science +11

Turning your relational database into a graph database 22 hours ago | towardsdatascience.com

augment data database data science +12

Yes, you still need old-school NLP skills in “the age of ChatGPT” 1 day ago | towardsdatascience.com

age chatgpt data data science +12

The Two Documents Every Data Scientist Must Write Before Taking Interviews 1 day, 1 hour ago | towardsdatascience.com

alert career advice data data science +11

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

all AI news

DL Notes: Advanced Gradient Descent

The main optimization algorithms used for training neural networks, explained and implemented from scratch in Python

More from towardsdatascience.com / Towards Data Science - Medium

Jobs in AI, ML, Big Data

Software Engineer for AI Training Data (School Specific)

Software Engineer for AI Training Data (Python)

Software Engineer for AI Training Data (Tier 2)

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)