Aug. 31, 2022, 2:40 p.m. | Angelica Lo Duca

Towards Data Science - Medium towardsdatascience.com

Data Preprocessing

A tutorial on how to build data pipelines using VDK to handle missing values

Photo by Markus Spiske on Unsplash

VMware has recently released a new framework, Versatile Data Kit (VDK), which you can use for Data Ingestion and Data Processing. VDK helps you to easily perform complex operations, such as data ingestion from different sources, using either SQL or Python. In other words, you can use VDK to build data lakes, where you ingest raw …

data data engineering data lake data pipeline data preprocessing missing values sql values

Senior Data Engineer

@ Displate | Warsaw

Associate Director, Technology & Data Lead - Remote

@ Novartis | East Hanover

Product Manager, Generative AI

@ Adobe | San Jose

Associate Director – Data Architect Corporate Functions

@ Novartis | Prague

Principal Data Scientist

@ Salesforce | California - San Francisco

Senior Analyst Data Science

@ Novartis | Hyderabad (Office)