Is GELU, the ReLU successor ? | allainews.com

Aug. 30, 2022, 4:01 p.m. | Poulinakis Kon

Towards AI - Medium pub.towardsai.net

Is GELU the ReLU Successor?

Photo by Willian B. on Unsplash

Can we combine regularization and activation functions? In 2016 a paper from authors Dan Hendrycks and Kevin Gimpel came out. Since then, the paper now has been updated 4 times. The authors introduced a new activation function, the Gaussian Error Linear Unit, GELU.

Demystifying GELU

The motivation behind GELU is to bridge stochastic regularizers, such as dropout, with non-linearities, i.e., activation functions.

Dropout regularization stochastically multiplies a neuron’s …

activation-functions data science deep learning machine learning neural networks relu

More from pub.towardsai.net / Towards AI - Medium

How to Play Flappy Bird in ChatGPT: A Prompt Engineering Challenge 1 day, 10 hours ago | pub.towardsai.net

artificial intelligence bird challenge chatgpt +12

Unveiling the Future: Mastering Stock Market Prediction with PMDARIMA 1 day, 11 hours ago | pub.towardsai.net

algorithmic-trading data analysis data science forecasting +9

SaaS-based Engineering Tool Onboarding with AI Assistance 1 day, 11 hours ago | pub.towardsai.net

engineering-tools langchain langgraph llm +1

Analyzing MRI Scans With AI (Tensorflow) Is Easier Than You Think 2 days, 6 hours ago | pub.towardsai.net

artificial intelligence deep learning machine learning medical +1

Best Resources to Learn & Understand Evaluating LLMs 2 days, 8 hours ago | pub.towardsai.net

academia ai data science deep learning +12

Deploying Your Models (Cheap and Dirty Way) Using Binder 2 days, 10 hours ago | pub.towardsai.net

ai collaborative deploy machine +8

Data Science Case Study — Credit Default Prediction: Part 1 3 days, 6 hours ago | pub.towardsai.net

agreement artificial intelligence breach case +20

Learn AI Together — Towards AI Community Newsletter #22 3 days, 7 hours ago | pub.towardsai.net

ai ai community artificial intelligence beta +15

Exploring HENet: Forcing a Network to Think More for Font Recognition: A Brief Overview 3 days, 8 hours ago | pub.towardsai.net

data science deep learning document-intelligence font-recognition +5

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net