May 7, 2022, 12:01 p.m. | Vivek Chaudhary

Towards AI - Medium pub.towardsai.net

The objective of this blog is to understand how to build a Data Engineering pipeline using Julia Lang. There are a lot of blogs on the internet that talk about the theory of DE, and here I am going to talk CODE, less theory more code.

Import necessary libraries and dataset

using CSV
using DataFrames
using Dates
#load the datasets into memory
emp_df = CSV.read("D:\\Julia\\emp.csv", DataFrame)
print(typeof(emp_df))
#first()to access top 5 records
first(emp_df,5)
dept_df = CSV.read("D:\\Julia\\dept.csv", DataFrame)
print(typeof(dept_df))
first(dept_df,5)

Data …

code data data engineering engineering etl-pipeline julia julialang programming

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Robotics Technician - Weekend Day Shift

@ GXO Logistics | Hillsboro, OR, US, 97124

Gen AI Developer

@ NTT DATA | Irving, TX, US

Applied AI/ML - Vice President

@ JPMorgan Chase & Co. | LONDON, United Kingdom

Research Fellow (Computer Science/Engineering/AI)

@ Nanyang Technological University | NTU Main Campus, Singapore

Senior Machine Learning Engineer

@ Rasa | Remote - Germany