Self attention with better complexity? | allainews.com

April 14, 2023, 4:24 p.m. | /u/abstract000

Deep Learning www.reddit.com

I work with transformers for many years and I know those models quite well. I use mostly LayoutLM.

Those lase weeks I looked at papers trying to reduce self attention complexity. The first was LongFormer. As I love the idea in the paper, I think the implementation is currently impossible as it would need sparse tensors. We tried those at work and have no speedup if the tensor is not VERY sparse. If you have a good way to deal …

attention complexity deal deeplearning good implementation layoutlm love paper reduce tensor think transformers work

More from www.reddit.com / Deep Learning

Classification of images with numerical "continous" categories 1 day, 3 hours ago | www.reddit.com

age classification clear deeplearning +6

How can I truly learn to code the models, not just understand them? 1 day, 17 hours ago | www.reddit.com

architectures code coding concepts +9

How does gradient descent work in random forest 1 day, 19 hours ago | www.reddit.com

beast deeplearning gradient parameters +2

Prerequisites for jumping into transformers? 1 day, 21 hours ago | www.reddit.com

basics cnns concepts deep learning +11

[Reading] Deeplearning by goodfellow 2 days, 3 hours ago | www.reddit.com

alternative assessment bayesian change +9

Best way to make a deep learning model that is an expert in a niche? 2 days, 17 hours ago | www.reddit.com

analytics building deep learning deeplearning +8

Linearizing Large Language Models 2 days, 19 hours ago | www.reddit.com

data deeplearning mistral rnn +2

Converting Soft tokens to Hard tokens in Llama2 2 days, 21 hours ago | www.reddit.com

concrete deeplearning embeddings good +9

Detection of free parking spaces 3 days, 4 hours ago | www.reddit.com

big deeplearning detection developer +8

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net