Aug. 9, 2022, 7:14 p.m. | Aaditya Bhat

Towards Data Science - Medium towardsdatascience.com

Build geo specific subset of LAION-5B

Photo by Dennis Kummer on Unsplash

Introduction to LAION-5B

Large-scale Artificial Intelligence Open Network (LAION), is a non-profit organization making machine learning resources available to the general public. Recently, LAION released a dataset of 5.85 billion image-text pairs collected from the internet. LAION-5B dataset contains urls, text along with a KNN index.

The KNN index powers a search engine called clip retrieval that enables users to explore the LAION-5B dataset interactively. Clip retrieval provides …

dataset geodata geotagging image laion location-data

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne