Sheetagent: Towards a generalist agent for spreadsheet reasoning and manipulation via large language models

Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao, Hangyu Mao, Fuzheng Zhang · 2025 · arXiv 6410.371496

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

baseline 1

citation-polarity summary

baseline 1

representative citing papers

BlueFin: Benchmarking LLM Agents on Financial Spreadsheets

cs.SE · 2026-05-29 · unverdicted · novelty 6.0

BlueFin is a new benchmark for LLM agents on financial spreadsheets showing frontier models score below 50% with weaknesses in dynamic correctness.

Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning

cs.AI · 2026-05-21 · unverdicted · novelty 6.0

Spreadsheet-RL applies RL fine-tuning and a custom Gym environment to raise LLM agent Pass@1 scores on spreadsheet benchmarks from roughly 8-12% to 17-23%.

Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning

cs.CL · 2026-04-14 · unverdicted · novelty 6.0

SpreadsheetAgent uses incremental multi-format reading, structural sketching, and verification to raise spreadsheet benchmark accuracy from 35.27% to 38.16%.

citing papers explorer

Showing 3 of 3 citing papers after filters.

BlueFin: Benchmarking LLM Agents on Financial Spreadsheets cs.SE · 2026-05-29 · unverdicted · none · ref 12
BlueFin is a new benchmark for LLM agents on financial spreadsheets showing frontier models score below 50% with weaknesses in dynamic correctness.
Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning cs.AI · 2026-05-21 · unverdicted · none · ref 5
Spreadsheet-RL applies RL fine-tuning and a custom Gym environment to raise LLM agent Pass@1 scores on spreadsheet benchmarks from roughly 8-12% to 17-23%.
Towards Robust Real-World Spreadsheet Understanding with Multi-Agent Multi-Format Reasoning cs.CL · 2026-04-14 · unverdicted · none · ref 5
SpreadsheetAgent uses incremental multi-format reading, structural sketching, and verification to raise spreadsheet benchmark accuracy from 35.27% to 38.16%.

Sheetagent: Towards a generalist agent for spreadsheet reasoning and manipulation via large language models

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer