pith. sign in

hub Canonical reference

arXiv preprint arXiv:2205.10330 (2022)

Canonical reference. 100% of citing Pith papers cite this work as background.

12 Pith papers citing it
Background 100% of classified citations

hub tools

citation-role summary

background 5

citation-polarity summary

years

2026 7 2025 5

verdicts

UNVERDICTED 12

roles

background 5

polarities

background 5

clear filters

representative citing papers

Regularized Reward-Punishment Reinforcement Learning

cs.LG · 2026-06-26 · unverdicted · novelty 5.0

Introduces KCPR and its deep form klDMP that couples reward and punishment policies via learned priors, yielding improved safety and stability in grid-world and Gazebo navigation tasks over DQN, SQL and softDMP.

Safe and Policy-Compliant Multi-Agent Orchestration for Enterprise AI

cs.AI · 2026-04-19 · unverdicted · novelty 5.0

CAMCO enforces policy constraints on multi-agent AI at deployment time via convex projection, risk-weighted Lagrangian shaping, and bounded-convergence negotiation, yielding zero violations and 92-97% utility in tested enterprise scenarios.

citing papers explorer

Showing 1 of 1 citing paper after filters.