LLMLingua: Innovating LLM efficiency with prompt compression

Dec. 7, 2023, 5 p.m. | Alyssa Hughes

Advanced prompting technologies for LLMs can lead to excessively long prompts, causing issues. Learn how LLMLingua compresses prompts up to 20x, maintaining quality, reducing latency, and supporting improved UX.

The post LLMLingua: Innovating LLM efficiency with prompt compression appeared first on Microsoft Research.

advanced compression efficiency latency learn llm llms microsoft microsoft research prompt prompting prompts quality research research blog technologies

Visit resource

More from www.microsoft.com / Microsoft Research

What’s Your Story: Jacki O’Neill 4 days, 3 hours ago | www.microsoft.com

africa expand good her +10

Research Focus: Week of May 13, 2024 4 days, 22 hours ago | www.microsoft.com

applications blog code community +20

Microsoft at CHI 2024: Innovations in human-centered design 5 days ago | www.microsoft.com

computer design human human-computer interaction +12

RASCAL: Novel robotics for scalable and highly available automated storage and retrieval 6 days ago | www.microsoft.com

automated availability challenges design +10

Enhanced autoscaling with VASIM: Vertical Autoscaling Simulator Toolkit 1 week ago | www.microsoft.com

adjusting algorithms cloud cost +15

MatterSim: A deep-learning model for materials under real-world conditions 1 week ago | www.microsoft.com

challenge design digital digital transformation +13

LLM profiling guides KV cache optimization 1 week, 5 days ago | www.microsoft.com

cache data guides key +15

LoftQ: Reimagining LLM fine-tuning with smarter initialization 1 week, 6 days ago | www.microsoft.com

ai technology computational efficiency energy +11

Abstracts: May 6, 2024 2 weeks ago | www.microsoft.com

benchmark capabilities create data +13

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

all AI news

LLMLingua: Innovating LLM efficiency with prompt compression

More from www.microsoft.com / Microsoft Research

Jobs in AI, ML, Big Data

Software Engineer for AI Training Data (School Specific)

Software Engineer for AI Training Data (Python)

Software Engineer for AI Training Data (Tier 2)

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)