LLMs as General Pattern Machines [R] | allainews.com

Jan. 25, 2024, 7:21 p.m. | /u/we_are_mammals

Machine Learning www.reddit.com

Full text: [https://arxiv.org/abs/2307.04721](https://arxiv.org/abs/2307.04721)

Abstract:

>We observe that pre-trained large language models (LLMs) are capable of autoregressively completing complex token sequences -- from arbitrary ones procedurally generated by probabilistic context-free grammars (PCFG), to more rich spatial patterns found in the Abstraction and Reasoning Corpus (ARC), a general AI benchmark, prompted in the style of ASCII art. Surprisingly, pattern completion proficiency can be partially retained even when the sequences are expressed using tokens randomly sampled from the vocabulary. These results …

abstract abstraction ai benchmark arc art benchmark context found free general general ai generated language language models large language large language models llms machinelearning observe patterns reasoning spatial token

More from www.reddit.com / Machine Learning

[R] CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments 4 hours ago | www.reddit.com

agent ai-powered ai-powered tool automated +18

[D] Evaluating LLMs Long-Context performance: What are the best practices? 9 hours ago | www.reddit.com

benchmarks best practices context frameworks +8

[R] Measuring Vision-Language STEM Skills of Neural Models 10 hours ago | www.reddit.com

abstract authors challenge engineering +16

[R] NExT: Teaching Large Language Models to Reason about Code Execution 13 hours ago | www.reddit.com

abstract code debug debugging +20

How much coursework is required to land an entry-level ML job? [D] 15 hours ago | www.reddit.com

berkeley building epidemiology job +4

[D] Foundational papers for Graph Adversarial Learning? 17 hours ago | www.reddit.com

machinelearning papers understanding

[D] Suggestions for NLP Papers Commonly Implemented in ML Interviews 1 day, 4 hours ago | www.reddit.com

companies implementation interview interviews +10

[D] How can attention mechanisms retrieve meaningful information over long distances when using RoPE or … 1 day, 7 hours ago | www.reddit.com

attention attention mechanisms information machinelearning +3

[D] Do Lead's in an AI/DS/ML team always have PhDs, is it a requirement? 1 day, 7 hours ago | www.reddit.com

hello lecture machinelearning masters +3

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Developer AI Senior Staff Engineer, Machine Learning

@ Google | Sunnyvale, CA, USA; New York City, USA

View on ai-jobs.net

Engineer* Cloud & Data Operations (f/m/d)

@ SICK Sensor Intelligence | Waldkirch (bei Freiburg), DE, 79183

View on ai-jobs.net