NetAgentBench is an FSM-based benchmark showing that state-of-the-art LLM agents solve basic network configs but suffer exploration meltdowns and coherence collapse on expert-level tasks.
A theory of timed automata
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
method 1polarities
use method 1representative citing papers
The authors introduce affine repulsive RL policies that provably satisfy hard affine state constraints for black-box hybrid dynamical systems with affine reset maps by deriving sufficient closed-loop safety conditions and testing on pendulum and juggler examples.
citing papers explorer
-
NetAgentBench: A State-Centric Benchmark for Evaluating Agentic Network Configuration
NetAgentBench is an FSM-based benchmark showing that state-of-the-art LLM agents solve basic network configs but suffer exploration meltdowns and coherence collapse on expert-level tasks.
-
Learning Control Policies to Provably Satisfy Hard Affine Constraints for Black-Box Hybrid Dynamical Systems
The authors introduce affine repulsive RL policies that provably satisfy hard affine state constraints for black-box hybrid dynamical systems with affine reset maps by deriving sufficient closed-loop safety conditions and testing on pendulum and juggler examples.