AutoChunk: Automated Activation Chunk for Memory-Efficient Long Sequence Inference | allainews.com

March 5, 2024, 2:45 p.m. | Xuanlei Zhao, Shenggan Cheng, Guangyang Lu, Jiarui Fang, Haotian Zhou, Bin Jia, Ziming Liu, Yang You

cs.LG updates on arXiv.org arxiv.org

arXiv:2401.10652v2 Announce Type: replace-cross
Abstract: Large deep learning models have achieved impressive performance across a range of applications. However, their large memory requirements, including parameter memory and activation memory, have become a significant challenge for their practical serving. While existing methods mainly address parameter memory, the importance of activation memory has been overlooked. Especially for long input sequences, activation memory is expected to experience a significant exponential growth as the length of sequences increases. In this approach, we propose AutoChunk, …

abstract applications arxiv automated become challenge cs.dc cs.lg cs.pf deep learning importance inference memory performance practical requirements type

More from arxiv.org / cs.LG updates on arXiv.org

Red-Teaming for Generative AI: Silver Bullet or Security Theater? 2 days, 6 hours ago | arxiv.org

abstract arxiv concerns cs.cy +15

Efficient Data-Driven MPC for Demand Response of Commercial Buildings 2 days, 6 hours ago | arxiv.org

abstract arxiv buildings commercial +20

BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry 2 days, 6 hours ago | arxiv.org

arxiv cs.cv cs.lg diffusion +5

Data-Driven Physics-Informed Neural Networks: A Digital Twin Perspective 2 days, 6 hours ago | arxiv.org

abstract arxiv automated construction +26

Testing the Segment Anything Model on radiology data 2 days, 6 hours ago | arxiv.org

abstract applications arxiv become +20

Robust Point Matching with Distance Profiles 2 days, 6 hours ago | arxiv.org

abstract analyze arxiv cs.lg +13

Cell Maps Representation For Lung Adenocarcinoma Growth Patterns Classification In Whole Slide Images 2 days, 6 hours ago | arxiv.org

abstract arxiv behavior classification +18

Improved Baselines with Visual Instruction Tuning 2 days, 6 hours ago | arxiv.org

abstract academic arxiv clip +25

Calorimeter shower superresolution 2 days, 6 hours ago | arxiv.org

abstract arxiv challenge computational +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net