[R] Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning | allainews.com

Dec. 19, 2023, 5:58 p.m. | /u/Gaussian_Kernel

Machine Learning www.reddit.com

**Paper:** [**https://arxiv.org/pdf/2312.05571.pdf**](https://arxiv.org/pdf/2312.05571.pdf)

**Code:** [**https://github.com/joykirat18/SYRELM**](https://github.com/joykirat18/SYRELM)

**Abstract:** Large Language Models (LLM) exhibit zero-shot mathematical reasoning capacity as a behavior emergent with scale, commonly manifesting as chain-of-thoughts (CoT) reasoning. However, multiple empirical findings suggest that this prowess is exclusive to LLMs with exorbitant sizes (beyond 50 billion parameters). Meanwhile, educational neuroscientists suggest that symbolic algebraic manipulation be introduced around the same time as arithmetic word problems to modularize language-to-formulation, symbolic manipulation of the formulation, and endgame arithmetic. In this paper, we start with …

abstract behavior beyond billion capacity educational exclusive language language models large language large language models llm llms machinelearning manipulation mathematical reasoning multiple parameters reasoning scale thoughts word zero-shot

More from www.reddit.com / Machine Learning

[P] GPT-Burn: A simple & concise implementation of the GPT in pure Rust 🔥 3 hours ago | www.reddit.com

gpt implementation machinelearning rust +1

[R] 1:10 Radio Controlled Car autonomous driving 8 hours ago | www.reddit.com

advice autonomous autonomous driving cameras +13

[D] Machine Learning Engineers, what portion of your work is focused on deployment pipelines vs. … 18 hours ago | www.reddit.com

building data data engineer deployment +10

[D] How are subspace embeddings different from basic dimensionality reduction? 20 hours ago | www.reddit.com

advanced basic dimensionality embeddings +6

[P] Real Time Emotion Classification with FER-2013 dataset 1 day, 4 hours ago | www.reddit.com

accuracy classification dataset emotion +7

[D] Real chances to be accepted in NeurIPS 2024 - Other conferences 1 day, 9 hours ago | www.reddit.com

authors case conferences exit +5

[D] Seminal papers list since 2018 that will be considered cannon in the future 1 day, 11 hours ago | www.reddit.com

attention attention is all you need clip finally +13

[D] Are PyTorch high-level frameworks worth using? 1 day, 12 hours ago | www.reddit.com

biases experiment frameworks ignite +10

[D] Friday's Oxen.AI Water Cooler call: High-performance audio processing, Python vs Rust 1 day, 20 hours ago | www.reddit.com

audio conference data discuss +17

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net