March 12, 2024, 11:16 p.m. | /u/Barchimede

Machine Learning www.reddit.com

Hi all,

I wanted to make a post to launch a discussion about an interesting paper called [Datasheets for Datasets](https://www.microsoft.com/en-us/research/uploads/prod/2019/01/1803.09010.pdf) (> 1900 citations, so a big impact in the research community). Briefly explained, this paper discusses the lack of a standardized way for documenting a dataset to train ML models and proposes to document datasets with a 'datasheet,' similar to electronic components (or Warhammer figs, or any other items you know with datasheets). The authors illustrate this datasheet system with …

ai engineer colleagues computer computer vision conversations current datasets engineer experience information leads machinelearning paper phd role vision world

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US