NOLA: Compressing LoRA using Linear Combination of Random Basis | allainews.com

May 1, 2024, 4:46 a.m. | Soroush Abbasi Koohpayegani, KL Navaneet, Parsa Nooralinejad, Soheil Kolouri, Hamed Pirsiavash

cs.CV updates on arXiv.org arxiv.org

arXiv:2310.02556v2 Announce Type: replace-cross
Abstract: Fine-tuning Large Language Models (LLMs) and storing them for each downstream task or domain is impractical because of the massive model size (e.g., 350GB in GPT-3). Current literature, such as LoRA, showcases the potential of low-rank modifications to the original weights of an LLM, enabling efficient adaptation and storage for task-specific models. These methods can reduce the number of parameters needed to fine-tune an LLM by several orders of magnitude. Yet, these methods face two …

arxiv combination cs.cl cs.cv linear lora random type

More from arxiv.org / cs.CV updates on arXiv.org

DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness 13 hours ago | arxiv.org

abstract arxiv augment automated +18

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation 13 hours ago | arxiv.org

arxiv cs.cv eess.iv embedding +5

KI-PMF: Knowledge Integrated Plausible Motion Forecasting 13 hours ago | arxiv.org

abstract actors arxiv autonomous +18

Adaptive Landmark Color for AUV Docking in Visually Dynamic Environments 13 hours ago | arxiv.org

abstract arxiv autonomous batteries +15

OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction 13 hours ago | arxiv.org

abstract arxiv autonomous autonomous vehicles +21

Multimodal Chain-of-Thought Reasoning in Language Models 13 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +7

Robust Self-Tuning Data Association for Geo-Referencing Using Lane Markings 13 hours ago | arxiv.org

abstract advantages aerial arxiv +16

Surrogate-based cross-correlation for particle image velocimetry 13 hours ago | arxiv.org

arxiv correlation cs.cv eess.iv +5

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos 13 hours ago | arxiv.org

abstract action action recognition advance +20

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Intern - Robotics Industrial Engineer Summer 2024

@ Vitesco Technologies | Seguin, US

View on ai-jobs.net