Aug. 6, 2023, 6:15 p.m. | /u/jesseparks13

Data Science www.reddit.com

I am really confused about what types of data exploration and inspection you are allowed to do on the whole dataset BEFORE setting aside the test set. In the end-to-end machine learning project demonstrated in Hands-On Machine Learning by Geron Aurelien, the author checks the following before setting aside a test set: 1) quantities of data points and null values, 2) value counts of each value in categorical columns, 3) statistical summary of numerical columns, including counts, mean, std, min, …

author checks data data exploration datascience dataset exploration machine machine learning project set test types

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US