pith. sign in

hub Canonical reference

SimpleTIR: End-to-end reinforcementlearningformulti-turntool-integratedreasoning.arXivpreprint

Canonical reference. 83% of citing Pith papers cite this work as background.

19 Pith papers citing it
Background 83% of classified citations

hub tools

citation-role summary

background 5 baseline 1

citation-polarity summary

years

2026 17 2025 2

representative citing papers

Training Multi-Image Vision Agents via End2End Reinforcement Learning

cs.CV · 2025-12-05 · unverdicted · novelty 7.0

IMAgent trains a multi-image vision agent via pure end-to-end RL with visual reflection tools and a two-layer motion trajectory masking strategy, reaching SOTA on single- and multi-image benchmarks while revealing tool-use effects on attention.

Harnessing LLM Agents with Skill Programs

cs.AI · 2026-05-18 · conditional · novelty 6.0

HASP upgrades textual skills into executable Program Functions that intervene in LLM agent loops at inference, post-training, or self-evolution, delivering 25% gains over ReAct and 30.4% over Search-R1 on reasoning benchmarks.

citing papers explorer

Showing 19 of 19 citing papers.