pith. sign in

A closer look at invalid action masking in policy gradient algorithms

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 1 method 1

citation-polarity summary

years

2026 7

verdicts

UNVERDICTED 7

clear filters

representative citing papers

AlphaTransit: Learning to Design City-scale Transit Routes

cs.AI · 2026-05-27 · unverdicted · novelty 6.0

AlphaTransit pairs MCTS with a learned policy-value network to reach 54.6% and 82.1% service rates on a Bloomington transit benchmark, outperforming plain RL and plain MCTS baselines.

citing papers explorer

Showing 2 of 2 citing papers after filters.