Aug. 23, 2022, 2:08 p.m. | Kevin Kho

Towards Data Science - Medium towardsdatascience.com

Examining the limitations of the SQL interface

Written by Kevin Kho and Han Wang

This is a written version of our most recent Spark Data + AI Sumit talk.

Shiba Inu Piloting an Airplane — Image by Author

SQL-like Frameworks for Distributed Computing

In our last article, we talked about the limitations of using the Pandas interface for distributed computing. Some people quickly assumed that we are pro-SQL, but that is not exactly true either. Here, we’ll look …

computing data science distributed distributed computing fugue interfaces pandas spark sql

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Analyst

@ Alstom | Johannesburg, GT, ZA