A comparative study of counterfactual estimators

Nicolas Le Roux; Thomas Nedelec; Vianney Perchet

arxiv: 1704.00773 · v3 · pith:LPA4JWRWnew · submitted 2017-04-03 · 📊 stat.ML · cs.LG

A comparative study of counterfactual estimators

Thomas Nedelec , Nicolas Le Roux , Vianney Perchet This is my paper

classification 📊 stat.ML cs.LG

keywords estimatorsbasiccomparativeimportancesamplingaveragebeencase

0 comments

read the original abstract

We provide a comparative study of several widely used off-policy estimators (Empirical Average, Basic Importance Sampling and Normalized Importance Sampling), detailing the different regimes where they are individually suboptimal. We then exhibit properties optimal estimators should possess. In the case where examples have been gathered using multiple policies, we show that fused estimators dominate basic ones but can still be improved.

This paper has not been read by Pith yet.

A comparative study of counterfactual estimators

discussion (0)