April 25, 2022, 5:32 p.m. | /u/eyeeyecaptainn

Data Science www.reddit.com

I have to build a data pipeline (my first ever). I have data stored locally in csv files. I have to process and analyze and visualize these data.

I thought of using python/pandas. However the questions i need to answer require for me to create new dataframes with the data i’ve been provided. I was thinking if it’s good practice to load up the data in an SQL database and then create new tables required for the solution and get …

csv data database datascience sql sql database

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris