Memory inside sequence models | allainews.com

March 15, 2024, 8:51 p.m. | /u/Sinestro101

Deep Learning www.reddit.com

I’m trying to gain a deeper understanding of the concept of memory in RNNs (and it’s variants) and in Transformers.

Aside the architectural differences between the plain RNN, GRU and LSTM, memory is basically the input sequence being processed through some mathematical function and served in a sequential manner as input to the next time step (along input Xt) sort of as a prior representation of the data.

From this technical perspective memory seems constrained to the length of the …

concept deeplearning differences function gru inside lstm memory next rnn through transformers understanding variants

More from www.reddit.com / Deep Learning

Stable LM 2 runs Offline on Android (Open Source) 1 day, 3 hours ago | www.reddit.com

android deeplearning offline open source +2

MOMENT: A Foundation Model for Time Series Forecasting, Classification, Anomaly Detection and Imputation 1 day, 11 hours ago | www.reddit.com

anomaly anomaly detection building carnegie mellon +17

What deep learnng theory we really need? 2 days, 3 hours ago | www.reddit.com

blackbox deep learning deeplearning kind +4

Classical ML interview 2 days, 21 hours ago | www.reddit.com

algorithms deeplearning however interview +11

Deep Learning 2 days, 22 hours ago | www.reddit.com

deep learning deeplearning

Talking face generation!! 3 days, 14 hours ago | www.reddit.com

create deeplearning face generated +5

98% training accuracy but predictions on new images are wrong - Overfitting? 3 days, 21 hours ago | www.reddit.com

accuracy data deep learning deeplearning +7

Evolutionary Model Merging 4 days, 8 hours ago | www.reddit.com

deeplearning merging

Learning Deep Learning from scratch 4 days, 16 hours ago | www.reddit.com

book deep learning deeplearning hello +3

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist

@ Meta | Menlo Park, CA

View on ai-jobs.net

Principal Data Scientist

@ Mastercard | O'Fallon, Missouri (Main Campus)

View on ai-jobs.net