CA-PG reduces variance in Plackett-Luce ESR training by computing gradients on marginal item-inclusion probabilities rather than joint candidate-set probabilities.
Fast slate policy optimization: Going beyond plackett-luce
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.IR 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Credit-assigned Policy Gradient for Early Stage Retrieval in Two-stage Ranking
CA-PG reduces variance in Plackett-Luce ESR training by computing gradients on marginal item-inclusion probabilities rather than joint candidate-set probabilities.