Jan. 26, 2024, 1:33 a.m. | James Briggs

James Briggs www.youtube.com

OpenAI's new embedding models are text-embedding-3-small and text-embedding-3-large. These models are better than Ada 002 (text-embedding-ada-002), and we have the option of latency and storage-optimized text-embedding-3-smallor the higher accuracy text-embedding-3-large.

Key takeaways here are the pretty huge performance gains for multilingual embeddings — measured by the leap from 31.4% to 54.9% on the MIRACL benchmark. For English-language performance, we look at MTEB and see a smaller but still significant increase from 61% to 64.6%.

It's worth noting that the max …

accuracy ada benchmark embedding embedding models embeddings key latency multilingual openai performance small storage text

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne