July 10, 2023, 4:56 p.m. | /u/alkibijad

Machine Learning www.reddit.com

I’m using a vector database for storing image embeddings and using it for similarity search. If I pick top ten most similar vectors I can sometimes end up inside of an echo chamber with almost “duplicates” or too similar images. I would like to diversify the results so that all the results are close to the input vector but different between themselves.

Are there common patterns/algorithms for this type of diversification?

The idea that I have: I want to pick …

database echo embeddings image images inside machinelearning search vector vector database vectors

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote