Dodo: Dynamic Contextual Compression for Decoder-only LMs | allainews.com

June 14, 2024, 4:42 a.m. | Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme

cs.CL updates on arXiv.org arxiv.org

arXiv:2310.02409v2 Announce Type: replace
Abstract: Transformer-based language models (LMs) are inefficient in long contexts. We propose Dodo, a solution for context compression. Instead of one vector per token in a standard transformer model, Dodo represents text with a dynamic number of hidden states at each layer, reducing the cost of self-attention to a fraction of typical time and space. Moreover, off-the-shelf models such as LLaMA can be adapted to Dodo by efficient parameter tuning methods such as LoRA. In use, …

abstract arxiv attention compression context cost cs.ai cs.cl cs.lg decoder dynamic hidden language language models layer lms per replace self-attention solution standard text token transformer transformer model type vector

More from arxiv.org / cs.CL updates on arXiv.org

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach 15 hours ago | arxiv.org

abstract algorithms analysis arxiv +22

Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation 15 hours ago | arxiv.org

abstract applications arxiv cs.ai +13

LLM-SQL-Solver: Can LLMs Determine SQL Equivalence? 15 hours ago | arxiv.org

abstract applications arxiv community +24

RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models 15 hours ago | arxiv.org

abstract advantages alignment arxiv +22

Exploring ChatGPT's Capabilities on Vulnerability Management 15 hours ago | arxiv.org

abstract analysis arxiv attention +22

Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction 15 hours ago | arxiv.org

action arxiv cs.cl cs.cv +9

Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation 15 hours ago | arxiv.org

abstract adapt arxiv capability +19

Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models 15 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +13

mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs 15 hours ago | arxiv.org

arxiv bootstrapping cs.cl cs.cv +5

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Senior Principal Software Engineer

@ Oracle | Columbia, MD, United States

View on ai-jobs.net

Software Engineer for Manta Systems

@ PXGEO | Linköping, Östergötland County, Sweden

View on ai-jobs.net

DevOps Engineer

@ Teradyne | Odense, DK

View on ai-jobs.net

LIDAR System Engineer Trainee

@ Valeo | PRAGUE - PRA2

View on ai-jobs.net

Business Applications Administrator

@ Allegro | Poznań, Poland

View on ai-jobs.net