Dec. 6, 2023, 6:14 p.m. | /u/mcbal2666

Machine Learning www.reddit.com

A non-equilibrium statistical mechanics perspective on transformers.

We present a class of transformers based on mean-field dynamics of vector-spin models. Our framework supports asymmetric couplings and yields residual, attention, and feed-forward terms.

Post: https://mcbal.github.io/post/spin-model-transformers

Code (JAX): https://github.com/mcbal/spin-model-transformers

attention dynamics equilibrium framework machinelearning mean perspective residual spin statistical terms transformers vector

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Senior Applied Data Scientist

@ dunnhumby | London

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV