March 12, 2024, 11:16 p.m. | /u/Barchimede

Machine Learning www.reddit.com

Hi all,

I wanted to make a post to launch a discussion about an interesting paper called [Datasheets for Datasets](https://www.microsoft.com/en-us/research/uploads/prod/2019/01/1803.09010.pdf) (> 1900 citations, so a big impact in the research community). Briefly explained, this paper discusses the lack of a standardized way for documenting a dataset to train ML models and proposes to document datasets with a 'datasheet,' similar to electronic components (or Warhammer figs, or any other items you know with datasheets). The authors illustrate this datasheet system with …

ai engineer colleagues computer computer vision conversations current datasets engineer experience information leads machinelearning paper phd role vision world

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne