all AI news
Unveiling Transformers with LEGO: a synthetic reasoning task. (arXiv:2206.04301v1 [cs.LG])
June 10, 2022, 1:12 a.m. | Yi Zhang, Arturs Backurs, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner
cs.CL updates on arXiv.org arxiv.org
We propose a synthetic task, LEGO (Learning Equality and Group Operations),
that encapsulates the problem of following a chain of reasoning, and we study
how the transformer architecture learns this task. We pay special attention to
data effects such as pretraining (on seemingly unrelated NLP tasks) and dataset
composition (e.g., differing chain length at training and test time), as well
as architectural variants such as weight-tied layers or adding convolutional
components. We study how the trained models eventually succeed at …
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Research Assistant/Associate, Health Data Science [LKCMedicine]
@ Nanyang Technological University | NTU Novena Campus, Singapore
Senior Machine Learning Engineer, Portfolio ML
@ Affirm | Remote Canada
[Sessional Lecturer] Foundations of Data Analytics and Machine Learning - APS1070
@ University of Toronto | Toronto, ON, CA
Senior Data Scientist
@ Prosper | United States
Data Analyst
@ ZF Friedrichshafen AG | Coimbatore, TN, IN, 641659