Jan. 4, 2022, 11:59 p.m. | /u/Yard1PL

Machine Learning www.reddit.com

tl;dr: train PyTorch models on large tabular datasets with a scikit-learn (skorch) API

Hi r/MachineLearning,

I'm the principal author of ray-skorch, a library that lets you run distributed PyTorch training on large-scale datasets while providing a familiar, scikit-learn compatible skorch API, integrating well with the rest of the scikit-learn ecosystem.

Under the hood, ray-skorch uses Ray Train for distributed PyTorch training and Ray Data for handling and shuffling large datasets.

ray-skorch works only with tabular data. Currently, …

api distributed machinelearning pytorch sklearn

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA