Continuous Integration for Data Science | allainews.com

March 26, 2024, 1:33 a.m. | /u/databot_

Data Science www.reddit.com

Hi r/datascience!

https://preview.redd.it/pp9vll5o0lqc1.png?width=800&format=png&auto=webp&s=e3639b7a1e01e98e854b152f93e32b7c410ca608

Over the last decade, I've participated in a dozen data science projects in industry. When projects hit production, it's critical to have unit and integration tests to prevent pushing faulty features or models. I've [summarized my learnings in a blog post](https://ploomber.io/blog/ci-for-ds/), here's the summary:

1. Structure your pipeline in several tasks, each one saving intermediate results to disk
2. Implement your pipeline in such a way that you can parametrize it
3. The first parameter …

change continuous data data science datascience integration intermediate location pipeline raw results sample saving science tasks testing

More from www.reddit.com / Data Science

Need Advice on Handling High-Dimensional Data in Data Science Project 12 hours ago | www.reddit.com

advice apply categorical data +11

Should I take the new offer? 14 hours ago | www.reddit.com

academic academic research analyst current +12

Where did you go to look for jobs? 21 hours ago | www.reddit.com

datascience fire google go to +8

Live Coding & Experimental Design Interview Questions 1 day, 5 hours ago | www.reddit.com

a/b testing b testing coding datascience +11

The Two Step SCM: A Tool for Data Scientists 1 day, 13 hours ago | www.reddit.com

causal causal inference code control +15

DS job market in EU 1 day, 14 hours ago | www.reddit.com

colleagues data datascience data scientist +7

MOMENT: A Foundation Model for Time Series Forecasting, Classification, Anomaly Detection and Imputation 1 day, 16 hours ago | www.reddit.com

anomaly anomaly detection building carnegie mellon +17

Best advice for mid-career? 1 day, 16 hours ago | www.reddit.com

advice big big pharma career +14

How does your model tracking framework looks like? 1 day, 18 hours ago | www.reddit.com

case datascience framework infrastructure +6

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Field Sample Specialist (Air Sampling) - Eurofins Environment Testing – Pueblo, CO

@ Eurofins | Pueblo, CO, United States

View on ai-jobs.net

Camera Perception Engineer

@ Meta | Sunnyvale, CA

View on ai-jobs.net