all AI news
Sampling vs Dask for large datasets
July 4, 2022, 10:32 p.m. | /u/xanga_ghost
Data Science www.reddit.com
per the title, has anyone found it preferable to sample from a large dataset instead of training from a dask dataframe comprised of all the data?
and if so, have you found there to be a heavy tradeoff in model quality? my main gripe with dask is that the .compute() methods seem to take forever.
i am still very green to it all, so apologies if …
More from www.reddit.com / Data Science
Moving to eBay as a Data Science Analyst?
1 day, 8 hours ago |
www.reddit.com
Impact of different tool use on future job prospects
1 day, 12 hours ago |
www.reddit.com
How do you prepare for performance reviews?
1 day, 12 hours ago |
www.reddit.com
What’s the deal with minimum 3 YOE on most of job postings?
1 day, 15 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne