March 9, 2022, 3:37 a.m. | /u/popcornylu

Machine Learning www.reddit.com

Hello [r/MachineLearning](https://www.reddit.com/r/MachineLearning/)

I would like to share a tool we just release for large files data version.

How do you version your data for machine learning? Just tar your data and put it in s3? use a shared folder in NFS? or use git-like solutions, like DVC, Git LFS?

Undoubtedly, putting large files in s3 (or similar object store) or NFS is the most common solution. However, when it comes to version control, Git is the defacto solution. But Git …

machinelearning version control

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US