pith. sign in

Acebench: Who wins the match point in tool usage? arXiv preprint arXiv:2501.12851, 2025 a

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

years

2026 6 2025 1

roles

dataset 1

polarities

use dataset 1

representative citing papers

On Effectiveness and Efficiency of Agentic Tool-calling and RL Training

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

Tool-calling evaluations for LLM agents are highly sensitive to implementation details such as random seeds and history handling, and two new techniques accelerate RL training with wall-clock speedup and no performance degradation.

Entropy Polarity in Reinforcement Fine-Tuning: Direction, Asymmetry, and Control

cs.LG · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

Entropy polarity is a signed token-level quantity derived from a first-order approximation of entropy change that predicts whether RL updates expand or contract policy entropy in LLM fine-tuning, revealing an asymmetry between high- and low-probability tokens.

citing papers explorer

Showing 7 of 7 citing papers.