[D] benefits of using only attention weights for LoRA | allainews.com

Nov. 2, 2023, 5 p.m. | /u/skelly0311

Machine Learning www.reddit.com

I'm confused as to why people would only use a subset of weights as learnable parameters for LoRA. If you are only using attention weights as update params, you still need the decomposed weights for the other layers to get the derivative of the loss with respect to the attention weights. That's how the chain rule works, so I don't see how it would help with memory consumption. Is there something I'm missing here?

attention benefits lora loss machinelearning parameters params people

More from www.reddit.com / Machine Learning

[R] AlphaMath Almost Zero: process Supervision without process 7 hours ago | www.reddit.com

abstract code errors however +15

[D] ECCV 2024 Review Discussion 8 hours ago | www.reddit.com

center conferences eccv machinelearning +5

[D] Is it a good idea for a 3rd year PhD student to start a … 10 hours ago | www.reddit.com

academic extra good hearing +7

[D] Use VQ-VAEs for SSL? 11 hours ago | www.reddit.com

create diffusion diffusion models embedding +10

[D] Matrix Profile vs. Deep Learning for Multivariate Time Series 13 hours ago | www.reddit.com

context curiosity data deep learning +16

[D] Reviewers you all need to stop being so lazy dog. Why are reviewers doing … 15 hours ago | www.reddit.com

authors check conference dog +8

[Research] Adaptable and Intelligent Generative AI through Advanced Information Lifecycle (AIL) 18 hours ago | www.reddit.com

abstract accuracy adaptability advanced +17

[Research] Consistency LLMs: converting LLMs to parallel decoders accelerates inference 3.5x 21 hours ago | www.reddit.com

check decoding deployment family +17

[D] Tips and tricks for performing large model checkpointing 21 hours ago | www.reddit.com

big challenge good job +10

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net