June 24, 2023, 5:25 a.m. | Hector Andres Mejia Vallejo

Towards Data Science - Medium towardsdatascience.com

Home-brewed machine learning model series

The companion repo is available here!

The curse of dimensionality is one major problem in machine learning. As the number of features increases, so does the complexity of the model. Moreover, if there is not enough training data, it results in overfitting.

In this entry, Principal Component Analysis (PCA) will be introduced. First, I will explain why too many features are a problem. Then, the math behind PCA and why it works. Additionally, PCA …

analysis companion complexity data data processing data science dimensionality dimensionality-reduction features home look machine machine learning machine learning model major overfitting principal-component the curse of dimensionality training training data

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne