pith. sign in

Title resolution pending

11 Pith papers cite this work, alongside 3,350 external citations. Polarity classification is still indexing.

11 Pith papers citing it
3,350 external citations · Crossref

citation-role summary

background 2 method 2

citation-polarity summary

years

2026 11

representative citing papers

Evaluating Plan Compliance in Autonomous Programming Agents

cs.SE · 2026-04-13 · unverdicted · novelty 7.0

Autonomous programming agents frequently fail to follow instructed plans, falling back on incomplete internalized workflows, while standard plans and periodic reminders improve performance but poor plans can degrade it more than no plan.

Evaluating LLM Agents on Automated Software Analysis Tasks

cs.SE · 2026-04-13 · unverdicted · novelty 7.0

A custom LLM agent achieves 94% manually verified success on a new benchmark of 35 software analysis setups, outperforming baselines at 77%, but struggles with stage mixing, error localization, and overestimating its own success.

citing papers explorer

Showing 11 of 11 citing papers.