April 18, 2023, 6:01 p.m. | Rodrigo Agundez

Towards AI - Medium pub.towardsai.net

ChunkDot support for sparse matrices

Photo by nabil boukala on Unsplash

In my previous blog post, I introduced ChunkDot, a library that performs multi-threaded matrix multiplication and cosine similarity. ChunkDot is appropriate for calculating the K most similar items for a large number of items by chunking the item matrix representation (embeddings) and using Numba to accelerate the calculations.

Cosine Similarity for 1 Trillion Pairs of Vectors

I described how ChunkDot works under the hood and I showed some …

benchmarks blog bulk cosine-similarity data science embeddings library matrix matrix multiplication memory nlp numba representation scale support

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

DevOps Engineer (Data Team)

@ Reward Gateway | Sofia/Plovdiv