Jan. 7, 2022, 11:35 p.m. | /u/Silly-XboxGamerTag

Data Science www.reddit.com

I was wondering what professional data analysts do when preparing data and cleaning it.

Here is what I currently do:

1- remove duplicates

2-handle Null values

3-melt using pandas to make the table cleaner

4-spot outliers and handle them

What more can I do ?

Also I watched someone remove some fields(columns) because they weren't important enough to include, how do people determine what's important to keep and what's not ?

submitted by /u/Silly-XboxGamerTag
[link] [comments]

data datascience python tips

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris