Revenge of the Fallen? Recurrent Models Match Transformers at Predicting Human Language Comprehension Metrics | allainews.com

May 1, 2024, 4:47 a.m. | James A. Michaelov, Catherine Arnett, Benjamin K. Bergen

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.19178v1 Announce Type: new
Abstract: Transformers have supplanted Recurrent Neural Networks as the dominant architecture for both natural language processing tasks and, despite criticisms of cognitive implausibility, for modelling the effect of predictability on online human language comprehension. However, two recently developed recurrent neural network architectures, RWKV and Mamba, appear to perform natural language tasks comparably to or better than transformers of equivalent scale. In this paper, we show that contemporary recurrent models are now also able to match - …

abstract architecture architectures arxiv cognitive cs.cl however human language language processing match metrics modelling natural natural language natural language processing network networks neural network neural networks processing recurrent neural network recurrent neural networks tasks transformers type

More from arxiv.org / cs.CL updates on arXiv.org

Biomedical knowledge graph-optimized prompt generation for large language models 13 hours ago | arxiv.org

abstract arxiv biomedical biomedicine +27

Primacy Effect of ChatGPT 13 hours ago | arxiv.org

arxiv chatgpt cs.ai cs.cl +2

Are Models Trained on Indian Legal Data Fair? 13 hours ago | arxiv.org

abstract advances applications artificial +27

Silver-Tongued and Sundry: Exploring Intersectional Pronouns with ChatGPT 13 hours ago | arxiv.org

abstract agent arxiv chatgpt +13

Exploring the Potential of Conversational AI Support for Agent-Based Social Simulation Model Design 13 hours ago | arxiv.org

abstract agent ai-powered ai systems +21

Robot Detection System 1: Front-Following 13 hours ago | arxiv.org

abstract advantages arxiv cs.cl +14

Refinement of an Epilepsy Dictionary through Human Annotation of Health-related posts on Instagram 13 hours ago | arxiv.org

abstract annotation arxiv biomedical +12

Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Intent Resolution in … 13 hours ago | arxiv.org

abstract arxiv beyond cs.ai +15

From Text to Context: An Entailment Approach for News Stakeholder Classification 13 hours ago | arxiv.org

abstract actors articles arxiv +13

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net