Feb. 12, 2024, 9:55 p.m. | /u/pikachuchameleon

Machine Learning www.reddit.com

Hi all, I am sharing our recent work on analyzing transformers via Markov chains. In particular, we design a framework that allows for a systematic theoretical and empirical analysis of these models. The paper is here: [https://arxiv.org/abs/2402.04161](https://arxiv.org/abs/2402.04161)

Looking forward to your constructive feedback and comments! :)

feedback machinelearning

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Research Scholar (Technical Research)

@ Centre for the Governance of AI | Hybrid; Oxford, UK

HPC Engineer (x/f/m) - DACH

@ Meshcapade GmbH | Remote, Germany

ETL Developer

@ Gainwell Technologies | Bengaluru, KA, IN, 560100

Medical Radiation Technologist, Breast Imaging

@ University Health Network | Toronto, ON, Canada

Data Scientist

@ PayPal | USA - Texas - Austin - Corp - Alterra Pkwy