Dec. 6, 2023, 6:14 p.m. | /u/mcbal2666

Machine Learning www.reddit.com

A non-equilibrium statistical mechanics perspective on transformers.

We present a class of transformers based on mean-field dynamics of vector-spin models. Our framework supports asymmetric couplings and yields residual, attention, and feed-forward terms.

Post: https://mcbal.github.io/post/spin-model-transformers

Code (JAX): https://github.com/mcbal/spin-model-transformers

attention dynamics equilibrium framework machinelearning mean perspective residual spin statistical terms transformers vector

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer - AWS

@ 3Pillar Global | Costa Rica

Cost Controller/ Data Analyst - India

@ John Cockerill | Mumbai, India, India, India