pith. machine review for the scientific record. sign in

Panoptic scene graph generation with semantics-prototype learning.AAAI, 38(4):3145–3153, Mar

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.AI 3

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

Counterfactual Trace Auditing of LLM Agent Skills

cs.AI · 2026-05-12 · unverdicted · novelty 7.0

CTA framework detects 522 skill influence patterns in LLM agent traces across 49 tasks where average pass rate shifts only +0.3%, exposing evaluation gaps in behavioral effects like template copying and excess planning.

FORTIS: Benchmarking Over-Privilege in Agent Skills

cs.AI · 2026-05-09 · unverdicted · novelty 7.0 · 2 refs

FORTIS benchmark shows over-privilege is the norm in LLM agent skill selection and execution, with models reaching for higher-privilege skills and tools than required across ten frontier models and three domains.

citing papers explorer

Showing 3 of 3 citing papers.

  • Counterfactual Trace Auditing of LLM Agent Skills cs.AI · 2026-05-12 · unverdicted · none · ref 21

    CTA framework detects 522 skill influence patterns in LLM agent traces across 49 tasks where average pass rate shifts only +0.3%, exposing evaluation gaps in behavioral effects like template copying and excess planning.

  • FORTIS: Benchmarking Over-Privilege in Agent Skills cs.AI · 2026-05-09 · unverdicted · none · ref 21 · 2 links

    FORTIS benchmark shows over-privilege is the norm in LLM agent skill selection and execution, with models reaching for higher-privilege skills and tools than required across ten frontier models and three domains.

  • Geometry over Density: Few-Shot Cross-Domain OOD Detection cs.AI · 2026-05-05 · unverdicted · none · ref 28 · 3 links

    UFCOD extracts Path Energy and Dynamics Energy from diffusion trajectories to perform few-shot OOD detection across unrelated domains with one fixed model.