pith. sign in

A closer look at invalid action masking in policy gradient algorithms

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1 method 1

citation-polarity summary

years

2026 5

clear filters

representative citing papers

AlphaTransit: Learning to Design City-scale Transit Routes

cs.AI · 2026-05-27 · unverdicted · novelty 6.0

AlphaTransit pairs MCTS with a learned policy-value network to reach 54.6% and 82.1% service rates on a Bloomington transit benchmark, outperforming plain RL and plain MCTS baselines.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • AlphaTransit: Learning to Design City-scale Transit Routes cs.AI · 2026-05-27 · unverdicted · none · ref 26

    AlphaTransit pairs MCTS with a learned policy-value network to reach 54.6% and 82.1% service rates on a Bloomington transit benchmark, outperforming plain RL and plain MCTS baselines.