March 5, 2024, 2:42 p.m. | Yu Mao, Weilan Wang, Hongchao Du, Nan Guan, Chun Jason Xue

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.01384v1 Announce Type: new
Abstract: Deploying Large Language Models (LLMs) on edge or mobile devices offers significant benefits, such as enhanced data privacy and real-time processing capabilities. However, it also faces critical challenges due to the substantial memory requirement of LLMs. Quantization is an effective way of reducing the model size while maintaining good performance. However, even after quantization, LLMs may still be too big to fit entirely into the limited memory of edge or mobile devices and have to …

abstract arxiv benefits capabilities challenges cs.ai cs.cl cs.lg data data privacy devices edge language language models large language large language models llms memory mobile mobile devices privacy processing quantization real-time real-time processing type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US