March 2, 2023, 2:54 a.m. | Craig Smith

Eye On A.I. www.eye-on.ai

In this episode, Ben Sorscher, a PhD student at Stanford, sheds light on the challenges posed by the ever-increasing size of data sets used to train machine learning models, specifically large language models. The sheer size of these data sets has been pushing the limits of scaling, as the cost of training and the environmental impact of the electricity they consume becomes increasingly enormous.

As a solution, Ben discusses the concept of “data pruning” - a method of reducing the …

artificial intelligence ben sorscher data pruning data reduction data sets efficiency environmental impact large language models machine learning model-performance phd resource efficiency scaling stanford sustainability training costs

More from www.eye-on.ai / Eye On A.I.

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Global Data Architect, AVP - State Street Global Advisors

@ State Street | Boston, Massachusetts

Data Engineer

@ NTT DATA | Pune, MH, IN