all AI news
[D] Layernorm is just two projections and can be improved
Feb. 24, 2024, 6:19 p.m. | /u/mgostIH
Machine Learning www.reddit.com
You project onto the hyperplane where the mean (or sum) of components is null (e.g. in 3D x + y + z = 0) by removing the mean and then project onto the sphere of radius sqrt(D) by dividing by the standard deviation. …
components hyperplane machinelearning mean null parameters project thinking translation vector
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Principal Data Engineering Manager
@ Microsoft | Redmond, Washington, United States
Machine Learning Engineer
@ Apple | San Diego, California, United States