Nov. 21, 2023, 10 a.m. | Anais Dotis-Georgiou

InfoWorld Analytics www.infoworld.com



Historically, working with big data has been quite a challenge. Companies that wanted to tap big data sets faced significant performance overhead relating to data processing. Specifically, moving data between different tools and systems required leveraging different programming languages, network protocols, and file formats. Converting this data at each step in the data pipeline was costly and inefficient.

Enter Apache Arrow, an open-source framework that defines an in-memory columnar data format that every analytical processing engine can use.

To …

analytics apache apache arrow arrow big big data challenge companies data database data pipeline data processing data sets influxdb languages moving network performance pipeline processing programming programming languages sql systems tools

Machine Learning Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, Ca

Team Lead Data Integrity

@ Maximus | Remote, United States

Machine Learning Research Scientist

@ Bosch Group | Pittsburgh, PA, United States

Data Engineer

@ Autodesk | APAC - India - Bengaluru - Sunriver

Data Engineer II

@ Mintel | Belfast

Data Engineer

@ Vector Limited | Auckland, New Zealand