all AI news
xLSTM: Extended Long Short-Term Memory
June 1, 2024, 10:23 p.m. | Yannic Kilcher
Yannic Kilcher www.youtube.com
Paper: https://arxiv.org/abs/2405.04517
Abstract:
In the 1990s, the constant error carousel and gating were introduced as the central ideas of the Long Short-Term Memory (LSTM). Since then, LSTMs have stood the test of time and contributed to numerous deep learning success stories, in particular they constituted the first Large Language Models (LLMs). However, the advent of the …
abstract architecture error ideas long short-term memory lstm memory results scale test training transformers xlstm
More from www.youtube.com / Yannic Kilcher
Jobs in AI, ML, Big Data
VP, Enterprise Applications
@ Blue Yonder | Scottsdale
Data Scientist - Moloco Commerce Media
@ Moloco | Redwood City, California, United States
Senior Backend Engineer (New York)
@ Kalepa | New York City. Hybrid
Senior Backend Engineer (USA)
@ Kalepa | New York City. Remote US.
Senior Full Stack Engineer (USA)
@ Kalepa | New York City. Remote US.
Senior Full Stack Engineer (New York)
@ Kalepa | New York City., Hybrid