all AI news
CBQ: Cross-Block Quantization for Large Language Models
Feb. 5, 2024, 6:43 a.m. | Xin Ding Xiaoyu Liu Zhijun Tu Yun Zhang Wei Li Jie Hu Hanting Chen Yehui Tang Zhiwei X
cs.LG updates on arXiv.org arxiv.org
block costs cs.cl cs.lg focus key language language models large language large language models layer leads llms low outliers paper performance quantization role training
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Consultant - Artificial Intelligence & Data (Google Cloud Data Engineer) - MY / TH
@ Deloitte | Kuala Lumpur, MY