March 9, 2023, 2 p.m. | AssemblyAI

AssemblyAI www.youtube.com

Python for AI Development Course #2: In this lesson, we learn how to explore, clean and prepare data for machine learning model training.

Get the code here: https://github.com/AssemblyAI/youtube-tutorials/blob/main/Data%20preparation%20and%20model%20training.ipynb

Get the data: https://www.kaggle.com/datasets/usdot/flight-delays

00:00 - Intro
01:22 - Types of data
02:00 - Data documentation
03:34 - Settings up the notebook
06:23 - First look at the data
09:41 - Missing values
20:24 - Outliers
24:46 - Issues with categorical values
30:26 - Preparing the target feature
35:06 - Final prep …

ai development assemblyai categorical course data deeplearning development documentation feature free learn look machine machine learning machinelearning machine learning model missing values notebook outliers pandas python training types values

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne