all AI news
Spark, Dask, DuckDB, Polars: TPC-H Benchmarks at Scale
Nov. 8, 2023, 1:56 p.m. | /u/mrocklin
Data Science www.reddit.com
[https://youtu.be/wKH0-zs2g\_U](https://youtu.be/wKH0-zs2g_U)
This is the result of a couple weeks of work comparing large data frameworks on benchmarks ranging in size 10GB to 10TB. No project wins. It's really interesting analyzing results though.
DuckDB and Dask are the only projects that reliably finish …
arrow benchmarks dask data datascience devs duckdb event frameworks fun nyc projects recording scale spark talk thought work
More from www.reddit.com / Data Science
Survival Analysis Question (For Attrition Prediction)
1 day, 7 hours ago |
www.reddit.com
How to transition to machine learning engineering?
1 day, 9 hours ago |
www.reddit.com
Offer from an org that is mostly operating in excel
1 day, 19 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Analyst (Digital Business Analyst)
@ Activate Interactive Pte Ltd | Singapore, Central Singapore, Singapore