[R] Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models | allainews.com

April 14, 2024, 6:14 p.m. | /u/SeawaterFlows

Machine Learning www.reddit.com

**Paper**: [https://arxiv.org/abs/2404.03622](https://arxiv.org/abs/2404.03622)

**Abstract**:

>Large language models (LLMs) have exhibited impressive performance in language comprehension and various reasoning tasks. However, their abilities in spatial reasoning, a crucial aspect of human cognition, remain relatively unexplored. Human possess a remarkable ability to create mental images of unseen objects and actions through a process known as **the Mind's Eye**, enabling the imagination of the unseen world. Inspired by this cognitive capacity, we propose **Visualization-of-Thought** (**VoT**) prompting. VoT aims to elicit spatial reasoning of LLMs …

abstract cognition create enabling however human images imagination language language models large language large language models llms machinelearning mind objects performance process reasoning spatial tasks through world

More from www.reddit.com / Machine Learning

[P] I reproduced Anthropic's recent interpretability research 4 hours ago | www.reddit.com

anthropic attention basic capabilities +8

[R] KAN: Kolmogorov-Arnold Networks 5 hours ago | www.reddit.com

abstract every function functions +11

[D] Is RPE still a valid approach, or is RoPE entirely superior? 9 hours ago | www.reddit.com

attention datasets embed information +8

[D] TensorDock — GPU Cloud Marketplace, H100s from $2.49/hr 11 hours ago | www.reddit.com

building cloud cloud gpu gpu +17

How does freezing a model work? [D] 14 hours ago | www.reddit.com

clip encoder guides inputs +9

[D] ICML 2024 Decision Thread 15 hours ago | www.reddit.com

create decision discuss every +9

Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land 19 hours ago | www.reddit.com

differentiable machinelearning

What cool thing are you using it for?[D] 1 day, 3 hours ago | www.reddit.com

agriculture car detection driving +8

[R] CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments 1 day, 4 hours ago | www.reddit.com

agent ai-powered ai-powered tool automated +18

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Software Engineer, Data Tools - Full Stack

@ DoorDash | Pune, India

View on ai-jobs.net

Senior Data Analyst

@ Artsy | New York City

View on ai-jobs.net