all AI news
Different file formats, a benchmark doing basic operations
DEV Community dev.to
Recently, I've been designing a data lake to store different types of data from various sources, catering to diverse demands across different areas and levels. To determine the best file type for storing this data, I compiled points of interest, considering the needs and demands of different areas. These points include:
Tool Compatibility
Tool compatibility refers to which tools can write and read a specific file type. No/low code tools are crucial, especially when tools like Excel/LibreOffice play a significant …
basic benchmark data dataengineering data lake datascience designing diverse file lake operations spark store tool type types