Testing of deep rein- forcement learning agents with surrogate models.ACM Transactions on Software Engineering and Methodology, 33(3):73:1–73:33

Matteo Biagiola, Paolo Tonella · 2024 · DOI 10.1145/3631970

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers

cs.NI · 2026-05-06 · unverdicted · novelty 7.0

ReGuard discovers network scenarios where RL controllers perform 43-64% worse than achievable and reduces those gaps by 79-85% with lightweight rule-based protection that preserves normal performance.

citing papers explorer

Showing 1 of 1 citing paper.

Worst-Case Discovery and Runtime Protection for RL-Based Network Controllers cs.NI · 2026-05-06 · unverdicted · none · ref 8
ReGuard discovers network scenarios where RL controllers perform 43-64% worse than achievable and reduces those gaps by 79-85% with lightweight rule-based protection that preserves normal performance.

Testing of deep rein- forcement learning agents with surrogate models.ACM Transactions on Software Engineering and Methodology, 33(3):73:1–73:33

fields

years

verdicts

representative citing papers

citing papers explorer