Oct. 11, 2023, 5:14 p.m. | /u/every_other_freackle

Data Science www.reddit.com

Let’s say you get an ad hoc task that will take an hour or two. You run an sql query extract the data from the db dump into .csv spin up a quick Jupyter notebook and be done with it. But what happens after?


How would you store/archive this project?
Committing Jupyter notebooks to a repo? Now you have bunch of html in your codebase. Code that’s impossible to pull request/review that also bloats the repo. If you clear the …

csv data datascience extract hour jupyter notebook project query spin sql sql query

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer - New Graduate

@ Applied Materials | Milan,ITA

Lead Machine Learning Scientist

@ Biogen | Cambridge, MA, United States