SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models | allainews.com

March 7, 2024, 5:42 a.m. | Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.03636v1 Announce Type: cross
Abstract: Spreadsheet manipulation is widely existing in most daily works and significantly improves working efficiency. Large language model (LLM) has been recently attempted for automatic spreadsheet manipulation but has not yet been investigated in complicated and realistic tasks where reasoning challenges exist (e.g., long horizon manipulation with multi-step reasoning and ambiguous requirements). To bridge the gap with the real-world requirements, we introduce $\textbf{SheetRM}$, a benchmark featuring long-horizon and multi-category tasks with reasoning-dependent manipulation caused by real-life …

agent arxiv cs.ai cs.lg language language models large language large language models manipulation reasoning spreadsheet type via

More from arxiv.org / cs.LG updates on arXiv.org

Gland Segmentation Via Dual Encoders and Boundary-Enhanced Attention 1 day, 2 hours ago | arxiv.org

abstract arxiv attention automated +8

Sliced Wasserstein with Random-Path Projecting Directions 1 day, 2 hours ago | arxiv.org

abstract applications arxiv cs.ai +12

TIM: An Efficient Temporal Interaction Module for Spiking Transformer 1 day, 2 hours ago | arxiv.org

arxiv cs.cv cs.lg cs.ne +3

Accuracy vs Memory Advantage in the Quantum Simulation of Stochastic Processes 1 day, 2 hours ago | arxiv.org

abstract accuracy arxiv assumptions +20

Accelerating Inference in Molecular Diffusion Models with Latent Representations of Protein Structure 1 day, 2 hours ago | arxiv.org

abstract arxiv biology cs.lg +18

Large Language Models can Strategically Deceive their Users when Put Under Pressure 1 day, 2 hours ago | arxiv.org

abstract agent arxiv behavior +11

Learning Extrinsic Dexterity with Parameterized Manipulation Primitives 1 day, 2 hours ago | arxiv.org

arxiv cs.lg cs.ro manipulation +1

The Un-Kidnappable Robot: Acoustic Localization of Sneaking People 1 day, 2 hours ago | arxiv.org

arxiv cs.lg cs.ro localization +3

Diffusion Models as Stochastic Quantization in Lattice Field Theory 1 day, 2 hours ago | arxiv.org

abstract arxiv cs.lg diffusion +15

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net