May 19, 2023, 11:41 a.m. | /u/Misterion777

Machine Learning www.reddit.com

Hey, I faced an unusual task and I'm not sure how to implement it.

Let's say I have a DB with a lot of images (with possible duplicates). First, I calculate embeddings for each of them with some encoder (irrelevant) and then apply clustering algorithm on these embeddings. The most important part is that I need to assign the cluster ID to each image.

Now, the tricky part is: new images are coming in to the system and I want …

algorithm apply clustering clustering algorithm embeddings encoder hey image images machinelearning part realtime

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA