s
April 18, 2024, 12:39 a.m. |

Simon Willison's Weblog simonwillison.net

mistralai/mistral-common


New from Mistral: mistral-common, an open source Python library providing "a set of tools to help you work with Mistral models".


So far that means a tokenizer! This is similar to OpenAI's tiktoken library in that it lets you run tokenization in your own code, which crucially means you can count the number of tokens that you are about to use - useful for cost estimates but also for cramming the maximum allowed tokens in the context window for …

ai anthropic code count generativeai library llms mistral openai open source promptengineering python rag set tokenization tokens tools work

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Tableau/PowerBI Developer (A.Con)

@ KPMG India | Bengaluru, Karnataka, India

Software Engineer, Backend - Data Platform (Big Data Infra)

@ Benchling | San Francisco, CA