LLMs as General Pattern Machines [R] | allainews.com

Jan. 25, 2024, 7:21 p.m. | /u/we_are_mammals

Machine Learning www.reddit.com

Full text: [https://arxiv.org/abs/2307.04721](https://arxiv.org/abs/2307.04721)

Abstract:

>We observe that pre-trained large language models (LLMs) are capable of autoregressively completing complex token sequences -- from arbitrary ones procedurally generated by probabilistic context-free grammars (PCFG), to more rich spatial patterns found in the Abstraction and Reasoning Corpus (ARC), a general AI benchmark, prompted in the style of ASCII art. Surprisingly, pattern completion proficiency can be partially retained even when the sequences are expressed using tokens randomly sampled from the vocabulary. These results …

abstract abstraction ai benchmark arc art benchmark context found free general general ai generated language language models large language large language models llms machinelearning observe patterns reasoning spatial token

More from www.reddit.com / Machine Learning

[P] Simplified PyTorch Implementation of AlphaFold 3 6 hours ago | www.reddit.com

alphafold alphafold 3 implementation machinelearning +2

[D] Are LLM observability tools really used in startups and companies? 9 hours ago | www.reddit.com

adversarial adversarial attacks attacks combination +12

[D] Does DSPy actually change the LM weights? 12 hours ago | www.reddit.com

change dspy engineering machinelearning +2

[D] How did OpenAI go from doing exciting research to a big-tech-like company? 12 hours ago | www.reddit.com

capabilities engineering fast forward gpt4 +6

Multimodal AI from First Principles - Most fundamental approaches [D] 12 hours ago | www.reddit.com

building fundamental machinelearning multimodal +4

[D] Culture of Recycling Old Conference Submissions in ML 14 hours ago | www.reddit.com

conference conferences culture iclr +10

[D] How Do You Efficiently Conduct Ablation Studies in Machine Learning? 15 hours ago | www.reddit.com

fine-tuning grid insights machine +7

[P] N-way-attention 19 hours ago | www.reddit.com

algorithm attention concept every +12

[D] What Is The Current State of LLM Ops 19 hours ago | www.reddit.com

applications automate combination current +11

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net