pith. machine review for the scientific record. sign in

Jonathan Ho and Stefano Ermon

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 2 2019 1

representative citing papers

ANO: A Principled Approach to Robust Policy Optimization

cs.AI · 2026-05-04 · unverdicted · novelty 6.0

ANO derives a robust policy optimizer from geometric principles that replaces clipping with a smooth redescending gradient, showing better performance and stability than PPO, SPO, and GRPO in MuJoCo, Atari, and RLHF experiments.

Remote Action Generation: Remote Control with Minimal Communication

cs.IT · 2026-05-03 · unverdicted · novelty 6.0

GRASP reduces communication in remote control by 12-fold on average (50-fold for continuous actions) by having actors generate actions via guided sampling and local policy learning instead of receiving full actions or rewards.

citing papers explorer

Showing 3 of 3 citing papers.