July 8, 2022, 9:06 p.m. | Yousry Mohamed

Towards Data Science - Medium towardsdatascience.com

Idempotent Writes to Delta Lake Tables

Walkthrough using open source delta lake

https://unsplash.com/photos/JI0KxozvOtQ

Introduction

According to Wikipedia:

Idempotence is the property of certain operations in mathematics and computer science whereby they can be applied multiple times without changing the result beyond the initial application.

Some of the non technical examples are elevator call buttons and crosswalk buttons. Having an idempotent software API is a critical required characteristic in many situations. One of such situations is Spark structured streaming. Structured streaming …

databricks data engineering data science delta delta-lake lake spark tables

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Scientist (Computer Science)

@ Nanyang Technological University | NTU Main Campus, Singapore

Intern - Sales Data Management

@ Deliveroo | Dubai, UAE (Main Office)