May 10, 2023, 7:45 p.m. | Wei Yi

Towards Data Science - Medium towardsdatascience.com

How distributed data parallel DDP and distributed model parallel DMP works in stochastic gradient descent with large models and huge data

data data science deep-dives deep learning distributed distributed data gradient gradient-descent large models model-parallelism pytorch reading science stochastic

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South