pith. sign in

Conservative safety critics for exploration

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.LG 2 cs.CV 1

years

2026 3

verdicts

UNVERDICTED 3

representative citing papers

An Agency-Transferring Model-Free Policy Enhancement Technique

cs.LG · 2026-06-08 · unverdicted · novelty 5.0

A model-free RL method arbitrates between a functional baseline policy and a learning policy, transferring agency over time to yield a standalone policy with high goal-reaching rates and competitive returns on continuous-control tasks.

Safe-Support Q-Learning: Learning without Unsafe Exploration

cs.LG · 2026-04-28 · unverdicted · novelty 5.0

Safe-Support Q-Learning trains Q-functions and policies in reinforcement learning without ever visiting unsafe states by constraining the behavior policy to a safe set and using KL-regularized Bellman targets in a two-stage framework.

citing papers explorer

Showing 3 of 3 citing papers.