Aug. 2, 2023, 3:49 p.m. | Jonathan Apple

Towards Data Science - Medium towardsdatascience.com

A toy example of bulk inference on commodity hardware using Python, via llama.cpp and PySpark.

Image by author via DALL-E

Why?

This exercise is about using Llama 2, an LLM (Large Language Model) from Meta AI, to summarize many documents at once. The scalable summarization of unstructured, semi-structured, and structured text can exist as a feature by itself, and also be part of data pipelines that feed into downstream machine learning models.

Specifically, we want to prove the …

apache spark generative-ai llama 2 llm nlp

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne