$Q$- and $A$-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

Anastasios A. Tsiatis; Eric B. Laber; Marie Davidian; Phillip J. Schulte

arxiv: 1202.4177 · v3 · pith:QAHIQCJYnew · submitted 2012-02-19 · 📊 stat.ME · cs.AI

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

Phillip J. Schulte , Anastasios A. Tsiatis , Eric B. Laber , Marie Davidian This is my paper

classification 📊 stat.ME cs.AI

keywords treatmentdatadecisiondynamicestimatingmethodsoptimalpatient

0 comments

read the original abstract

In clinical practice, physicians make a series of treatment decisions over the course of a patient's disease based on his/her baseline and evolving characteristics. A dynamic treatment regime is a set of sequential decision rules that operationalizes this process. Each rule corresponds to a decision point and dictates the next treatment action based on the accrued information. Using existing data, a key goal is estimating the optimal regime, that, if followed by the patient population, would yield the most favorable outcome on average. Q- and A-learning are two main approaches for this purpose. We provide a detailed account of these methods, study their performance, and illustrate them using data from a depression study.

This paper has not been read by Pith yet.

Q- and A-Learning Methods for Estimating Optimal Dynamic Treatment Regimes

discussion (0)