The paper gives the first tight necessity and sufficiency conditions for successful reward poisoning attacks in linear MDPs.
Title resolution pending
10 Pith papers cite this work. Polarity classification is still indexing.
years
2026 10verdicts
UNVERDICTED 10representative citing papers
Adam-HNAG is a splitting-based reformulation of Adam that yields the first convergence proof for Adam-type methods, including accelerated rates, in convex smooth optimization.
LLM-generated heuristics for HTN planning nearly match PANDA planner coverage while reducing search effort on 83% of shared problems across six benchmark domains.
FedQual improves federated label distribution learning under heterogeneous annotation quality via quality-adaptive training with a global anchor and reliability-aware aggregation, backed by new benchmarks and a proof that client-specific calibration strictly outperforms uniform calibration.
ABox abduction under repair semantics for inconsistent KBs yields a full complexity landscape in lightweight description logics DL-Lite and EL_bot.
DLM4G applies graph-aware adaptive noising in a diffusion framework to generate text from graphs, outperforming larger autoregressive and diffusion baselines in factual grounding and edit sensitivity on three datasets plus molecule captioning.
VISOR applies VLMs to automate robot test oracles for correctness and quality assessment while reporting uncertainty, with evaluation on GPT and Gemini showing trade-offs in precision and recall but poor uncertainty calibration.
Random slicing for subsampling combined with Nadaraya-Watson smoothing enables faster and improved persistence-based topological optimization of point clouds in 2D and 3D.
SACHI uses graph transformer convolutions on inter-agent coordination graphs to enrich partial-observation agents with content-dependent teammate information, yielding statistically significant gains over baselines in five cooperative tasks.
ResGIN-Att predicts drug synergy by extracting multi-scale molecular features with residual GIN, fusing them via LSTM, and modeling interactions with cross-attention, achieving competitive results on five benchmark datasets.
citing papers explorer
-
When Can You Poison Rewards? A Tight Characterization of Reward Poisoning in Linear MDPs
The paper gives the first tight necessity and sufficiency conditions for successful reward poisoning attacks in linear MDPs.
-
Adam-HNAG: A Convergent Reformulation of Adam with Accelerated Rate
Adam-HNAG is a splitting-based reformulation of Adam that yields the first convergence proof for Adam-type methods, including accelerated rates, in convex smooth optimization.
-
Hierarchical Task Network Planning with LLM-Generated Heuristics
LLM-generated heuristics for HTN planning nearly match PANDA planner coverage while reducing search effort on 83% of shared problems across six benchmark domains.
-
Trustworthy Federated Label Distribution Learning under Annotation Quality Disparity
FedQual improves federated label distribution learning under heterogeneous annotation quality via quality-adaptive training with a global anchor and reliability-aware aggregation, backed by new benchmarks and a proof that client-specific calibration strictly outperforms uniform calibration.
-
ABox Abduction for Inconsistent Knowledge Bases under Repair Semantics
ABox abduction under repair semantics for inconsistent KBs yields a full complexity landscape in lightweight description logics DL-Lite and EL_bot.
-
Factual and Edit-Sensitive Graph-to-Sequence Generation via Graph-Aware Adaptive Noising
DLM4G applies graph-aware adaptive noising in a diffusion framework to generate text from graphs, outperforming larger autoregressive and diffusion baselines in factual grounding and edit sensitivity on three datasets plus molecule captioning.
-
VISOR: A Vision-Language Model-based Test Oracle for Testing Robot
VISOR applies VLMs to automate robot test oracles for correctness and quality assessment while reporting uncertainty, with evaluation on GPT and Gemini showing trade-offs in precision and recall but poor uncertainty calibration.
-
Towards Scalable Persistence-Based Topological Optimization
Random slicing for subsampling combined with Nadaraya-Watson smoothing enables faster and improved persistence-based topological optimization of point clouds in 2D and 3D.
-
SACHI: Structured Agent Coordination via Holistic Information Integration in Multi-Agent Reinforcement Learning
SACHI uses graph transformer convolutions on inter-agent coordination graphs to enrich partial-observation agents with content-dependent teammate information, yielding statistically significant gains over baselines in five cooperative tasks.
-
Drug Synergy Prediction via Residual Graph Isomorphism Networks and Attention Mechanisms
ResGIN-Att predicts drug synergy by extracting multi-scale molecular features with residual GIN, fusing them via LSTM, and modeling interactions with cross-attention, achieving competitive results on five benchmark datasets.