Oct. 11, 2022, 7:27 p.m. | Dustin Liu

Towards Data Science - Medium towardsdatascience.com

In 15 minutes or less

Photo by Etienne Girardet on Unsplash

Context

AWS Athena is serverless and intended for ad-hoc SQL queries against data on AWS S3. However, maintaining data lineage and dependency is tedious and error-prone (no difference in Data Warehouses though).

DBT (Data Build Tool) has recently become extremely popular as it can automatically draw data lineage/generate documentation of the data pipeline, not to mention its’ other features like Snapshot, Jinja & Macro support.

There are tons of …

aws cloud data-build-tool dbt integration

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Management Associate

@ EcoVadis | Ebène, Mauritius

Senior Data Engineer

@ Telstra | Telstra ICC Bengaluru