all AI news
Does Pyspark have more detailed summary statistics beyond .describe and .summary?
I calculated the frequency distribution manually, but it would be helpful if there was a function to give you that and more. I'm searching but not seeing much.
Is there a particular Pyspark library I should be looking at? Thanks.
More from www.reddit.com / Data Science
@ Cepal Hellas Financial Services S.A. | Athens, Sterea Ellada, Greece
Senior Manager Data Engineering
@ Publicis Groupe | Bengaluru, India
Senior Data Modeler
@ Sanofi | Hyderabad
VP, Product Management - Data, AI & ML
@ Datasite | USA - MN - Minneapolis
Supervisão de Business Intelligence (BI)
@ Publicis Groupe | São Paulo, Brazil
Data Manager Advertising (f|m|d) (80-100%) - Zurich - Hybrid Work
@ SMG Swiss Marketplace Group | Zürich, Switzerland