June 13, 2022, 1:53 p.m. | Pan Cretan

Towards Data Science - Medium towardsdatascience.com

Is the PySpark API really missing key functionality?

Photo by William Bout on Unsplash

PySpark offers a fluent API that covers most needs. Still, experienced pandas users may find that some data transformations are not so straightforward. This article aims at providing a small number of recipes to cover use cases that some users may consider as not natively supported by the PySpark API. In reality they are supported, but they do require some more effort (and imagination).

We start …

map melt pandas pyspark recipes unpivot

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Data Scientist, gTech Ads

@ Google | Mexico City, CDMX, Mexico

Lead, Data Analytics Operations

@ Zocdoc | Pune, Maharashtra, India