Sampling vs Dask for large datasets | allainews.com

July 4, 2022, 10:32 p.m. | /u/xanga_ghost

Data Science www.reddit.com

i need someone with more grooves in their brain to help me out here.

per the title, has anyone found it preferable to sample from a large dataset instead of training from a dask dataframe comprised of all the data?

and if so, have you found there to be a heavy tradeoff in model quality? my main gripe with dask is that the .compute() methods seem to take forever.

i am still very green to it all, so apologies if …

dask datascience datasets large datasets sampling

More from www.reddit.com / Data Science

Networking easier to get a job? 4 hours ago | www.reddit.com

conversation datascience every hiring +10

How many companies out there are truly experimentation focused like Netflix? 9 hours ago | www.reddit.com

articles check datascience every +9

Just talked to some MDs about data science interviews and they were horrified. 9 hours ago | www.reddit.com

coding data data science datascience +15

Found this on Linkedin, Is this legit or some elaborate scam/data farming I'm unaware of? 20 hours ago | www.reddit.com

data datascience farming found +2

Moving to eBay as a Data Science Analyst? 1 day, 8 hours ago | www.reddit.com

analyst bank big commerce +13

Impact of different tool use on future job prospects 1 day, 12 hours ago | www.reddit.com

client consultant data datascience +13

How do you prepare for performance reviews? 1 day, 12 hours ago | www.reddit.com

datascience education etc events +8

What’s the deal with minimum 3 YOE on most of job postings? 1 day, 15 hours ago | www.reddit.com

datascience deal devs etc +11

Tech layoffs cross 70,000 in April 2024: Google, Apple, Intel, Amazon, and these companies cut … 2 days, 3 hours ago | www.reddit.com

amazon apple april companies +7

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net