all AI news
Spark, Dask, DuckDB, Polars: TPC-H Benchmarks at Scale
Nov. 8, 2023, 1:56 p.m. | /u/mrocklin
Data Science www.reddit.com
[https://youtu.be/wKH0-zs2g\_U](https://youtu.be/wKH0-zs2g_U)
This is the result of a couple weeks of work comparing large data frameworks on benchmarks ranging in size 10GB to 10TB. No project wins. It's really interesting analyzing results though.
DuckDB and Dask are the only projects that reliably finish …
arrow benchmarks dask data datascience devs duckdb event frameworks fun nyc projects recording scale spark talk thought work
More from www.reddit.com / Data Science
Have Data Scientist Interviews Evolved Over the Last Year?
1 day, 15 hours ago |
www.reddit.com
Tell me about older individual contributors
1 day, 20 hours ago |
www.reddit.com
Pedro Thermo Similarity vs Levenshtain/ OSA/ Jaro/ ..
1 day, 21 hours ago |
www.reddit.com
Struggling on where to plug Python into my workflow
1 day, 22 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US