Oct. 15, 2023, 2:30 a.m. | /u/Particular_Flower_12

Machine Learning www.reddit.com

i am a little puzzled,

1. i know that transformers is the HF framework/library to load infere and train models easily
2. and that llama.cpp is another framework/library that does the more of the same but specialized in models that runs on CPU and quanitized and run much faster
3. i understand that GGML is a file format for saving model parameters in a single file, that its an old problematic format, and GGUF is the new kid on the …

cpp cpu faster framework library llama machinelearning transformers

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

AI Engineering Manager

@ M47 Labs | Barcelona, Catalunya [Cataluña], Spain