March 24, 2024, 4:35 p.m. | /u/Asleep-Dress-3578

Data Science www.reddit.com

I work in a team of data scientists on time series forecasting pipelines, and I have the feeling that my colleagues overuse OOP paradigms. Let us say we have two dataframes, and we have a function which calculates some deltas between them:

def calculate_delta(df1: pd.DataFrame, df2: pd.DataFrame) -> pd.DataFrame:
delta = # some calculations
return delta

delta = calculate_delta(df1, df2)

What my coleagues usually do with this, that they wrap this function in a class, something like:

class DeltaCalculatorProcessor:
def …

colleagues data dataframe dataframes data processing datascience data scientists delta forecasting function functions oop pipelines processing scientists series team them time series time series forecasting work

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Tableau/PowerBI Developer (A.Con)

@ KPMG India | Bengaluru, Karnataka, India

Software Engineer, Backend - Data Platform (Big Data Infra)

@ Benchling | San Francisco, CA