all AI news
[P] What incremental unsolved problems are there in scaling machine learning training (distributed systems/Ray/data parallelism)?
April 8, 2024, 12:29 a.m. | /u/stereotypical_CS
Machine Learning www.reddit.com
I'm trying to find a distributed systems problem that isn't fully solved in Ray, or can be optimized. I'm not looking for a solution that will be the best for every scenario, but maybe a small tradeoff improvement that can be made (ex: trade off accuracy …
accuracy distributed distributed systems etc every improvement isn machinelearning ray recovery small solution systems trade will
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Business Data Scientist, gTech Ads
@ Google | Mexico City, CDMX, Mexico
Lead, Data Analytics Operations
@ Zocdoc | Pune, Maharashtra, India