Nov. 3, 2023, 12:57 p.m. | /u/davidmezzetti

Machine Learning www.reddit.com



https://preview.redd.it/mq8y8as0q4yb1.jpg?width=1374&format=pjpg&auto=webp&s=41a28968c929bf5a44fab03bef67991319e5728b

txtai is an all-in-one embeddings database for semantic search, LLM orchestration and language model workflows.

txtai has built-in support for quantizing vectors. The code above shows how 1-bit (binary) quantization can be applied. With 1-bit quantization, each dimension is transformed into a 1-bit value (0 or 1). Those bits are grouped into uint8's. This method can retain a surprising amount of accuracy, especially with high dimension models.

See the article below for more information along with benchmarks.

Article: …

binary code database embeddings language language model llm machinelearning orchestration quantization search semantic shows support value vector vectors workflows

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

C003549 Data Analyst (NS) - MON 13 May

@ EMW, Inc. | Braine-l'Alleud, Wallonia, Belgium

Marketing Decision Scientist

@ Meta | Menlo Park, CA | New York City