arXiv preprint arXiv:2502.05352 , year=

· 2025 · arXiv 2502.05352

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

read on arXiv browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Pooled Leaderboards Hide System-Specific Winners: A Reporting-Protocol Audit of Offline Root-Cause Analysis Benchmarks

cs.AI · 2026-06-28 · unverdicted · novelty 7.0

Pooled top-1 accuracy rankings in RCA benchmarks do not reliably identify per-subsystem winners, as pairwise comparisons across 11 subsystems show effects of both signs and leave-one-system-out selection incurs regret up to 24.8 pp.

LLMs Corrupt Your Documents When You Delegate

cs.CL · 2026-04-17 · unverdicted · novelty 6.0

LLMs corrupt an average of 25% of document content during long delegated editing workflows across 52 domains, even frontier models, and agentic tools do not mitigate the issue.

Auditable Graph-Guided Root Cause Analysis for Kubernetes Incidents

cs.SE · 2026-06-07 · conditional · novelty 5.0

Graph Traversal Agent improves root-cause F1 from 0.6087 to 0.9130 on ITBench snapshots but the gain is benchmark-coupled to cases where the injected fault is already in the evidence graph.

Runtime-Structured Task Decomposition for Agentic Coding Systems

cs.SE · 2026-05-14 · unverdicted · novelty 5.0

Runtime-structured task decomposition reduces retry costs in agentic coding systems by up to 51.7% versus monolithic prompts by rerunning only failed subtasks on two software engineering workloads.

From Assistance to Agency: Rethinking Autonomy and Control in CI/CD Pipelines

cs.SE · 2026-05-08 · unverdicted · novelty 5.0

The central challenge in AI-augmented CI/CD is designing authority transfer from humans to agents under constraints, as current systems remain limited to bounded data-plane autonomy backed by external governance.

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery

eess.SY · 2026-05-06 · unverdicted · novelty 5.0 · 2 refs

The paper introduces Experiment-as-Code Labs as a declarative stack synthesizing AI agents, systems orchestration, and physical lab control for AI-driven discovery.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

arXiv preprint arXiv:2502.05352 , year=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer