June 23, 2022, 4:10 p.m. | /u/data_dan_

Data Science www.reddit.com

There are so many aggregators/repositories of public data out there. Some of the ones I use the most are:
- Kaggle
- UCI Machine Learning Repository
- Hugging Face Datasets
- the Data is Plural Newsletter/Site

For those of you who use these types of public data sources...what do you look for? Organization and ease of use? Trustworthiness? Documentation? A process for routinely updating from the original source? Something else?

data datascience public public data

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Principal Data Engineering Manager

@ Microsoft | Redmond, Washington, United States

Machine Learning Engineer

@ Apple | San Diego, California, United States