[D] - Why do Attention layers work so well? Don't weights in DNNs already tell the network how much weight/attention to give to a specific input? (High weight = lots of attention, low weight = little attention) | allainews.com

Oct. 2, 2022, 8:56 p.m. | /u/029187

Machine Learning www.reddit.com

So an attention layer has a Q, K, and V vector My understanding is the goal is to say for a given query q, how relevant is the value v.

From this the network learns which data is relevant to focus on for a given input.

But what I don't get is why this is effective. Don't DNNs already do this with weights? A neuron in a hidden layer can be set off by any arbitrary combination of inputs, so …

attention low machinelearning network work

More from www.reddit.com / Machine Learning

[D] A slide which makes you feel old 3 hours ago | www.reddit.com

machinelearning

[R] Backpropagation through space, time, and the brain 8 hours ago | www.reddit.com

abstract artificial credit however +8

[N] Kaiming He's lecture on DL architecture for Representation Learning 10 hours ago | www.reddit.com

advances architecture good lecture +3

Do you think Reinforcement Learning still got it? [D] 15 hours ago | www.reddit.com

alphago architectures big computer +15

[P] TorchFix - a linter for PyTorch-using code with autofix support 17 hours ago | www.reddit.com

machinelearning

[D] Is Google Set to Dominate the RAG Scene with Its Massive Data Resources? 18 hours ago | www.reddit.com

basic big data google +16

[P] AI-based Language Teacher that can run locally on a 12GB graphics card (RTX 4070) 21 hours ago | www.reddit.com

application card fun graphics +7

[D] Embeddings search "drowning" in a sea of noise! Can you solve this riddle? 22 hours ago | www.reddit.com

application concept dimensions embeddings +15

Any ways to improve TabNet..??? [D] 1 day, 3 hours ago | www.reddit.com

machinelearning

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Software Engineer - Artificial Intelligence, LLM

@ OpenText | Hyderabad, TG, IN

View on ai-jobs.net

Lead Software Engineer- Python Data Engineer

@ JPMorgan Chase & Co. | GLASGOW, LANARKSHIRE, United Kingdom

View on ai-jobs.net

Data Analyst (m/w/d)

@ Collaboration Betters The World | Berlin, Germany

View on ai-jobs.net

Data Engineer, Quality Assurance

@ Informa Group Plc. | Boulder, CO, United States

View on ai-jobs.net

Director, Data Science - Marketing

@ Dropbox | Remote - Canada

View on ai-jobs.net