March 29, 2022, 5:42 p.m. | Andrew Engel

Towards Data Science - Medium towardsdatascience.com

Time series aggregations on the modern data stack

Photo by Donald Wu on Unsplash

In many business data science problems, data with a time series character (e.g., transactions, sensor readings, etc.) must be aggregated to an individual (e.g., customer, piece of equipment, etc.) level. Doing this in a modern machine learning environment can lead to trouble as traditional train-test and cross-validation splits will fail. This is because randomly splitting observations between training and test sets will almost always split related …

data data preparation data science engineering feature engineering time time series time-series-analysis

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Cint | Gurgaon, India

Data Science (M/F), setor automóvel - Aveiro

@ Segula Technologies | Aveiro, Portugal