Estimating tree-based dynamic treatment regimes using observational data with restricted treatment sequences

被引:2
|
作者
Zhou, Nina [1 ]
Wang, Lu [1 ]
Almirall, Daniel [2 ]
机构
[1] Univ Michigan, Dept Biostat, 1415 Washington Hts M4132 SPH II, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Inst Social Res, Ann Arbor, MI USA
基金
美国国家卫生研究院;
关键词
constrained optimization; dynamic treatment regime; observational studies; tree-based reinforcement learning; viable decision rules; RANDOMIZED-TRIAL; WEIGHT-LOSS; LORCASERIN; CARE;
D O I
10.1111/biom.13754
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A dynamic treatment regime (DTR) is a sequence of decision rules that provide guidance on how to treat individuals based on their static and time-varying status. Existing observational data are often used to generate hypotheses about effective DTRs. A common challenge with observational data, however, is the need for analysts to consider "restrictions" on the treatment sequences. Such restrictions may be necessary for settings where (1) one or more treatment sequences that were offered to individuals when the data were collected are no longer considered viable in practice, (2) specific treatment sequences are no longer available, or (3) the scientific focus of the analysis concerns a specific type of treatment sequences (eg, "stepped-up" treatments). To address this challenge, we propose a restricted tree-based reinforcement learning (RT-RL) method that searches for an interpretable DTR with the maximum expected outcome, given a (set of) user-specified restriction(s), which specifies treatment options (at each stage) that ought not to be considered as part of the estimated tree-based DTR. In simulations, we evaluate the performance of RT-RL versus the standard approach of ignoring the partial data for individuals not following the (set of) restriction(s). The method is illustrated using an observational data set to estimate a two-stage stepped-up DTR for guiding the level of care placement for adolescents with substance use disorder.
引用
收藏
页码:2260 / 2271
页数:12
相关论文
共 50 条
  • [31] Estimating Treatment Effects Using Observational Data and Experimental Data with Non-Overlapping Support
    Han, Kevin
    Wu, Han
    Wu, Linjia
    Shi, Yu
    Liu, Canyao
    ECONOMETRICS, 2024, 12 (03)
  • [32] Integrating randomized and observational studies to estimate optimal dynamic treatment regimes
    Batorsky, Anna
    Anstrom, Kevin J.
    Zeng, Donglin
    BIOMETRICS, 2024, 80 (02)
  • [33] Replica tree-based federated learning using limited data
    Ghilea, Ramona
    Rekik, Islem
    NEURAL NETWORKS, 2025, 186
  • [34] New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes
    Zhao, Ying-Qi
    Zeng, Donglin
    Laber, Eric B.
    Kosorok, Michael R.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (510) : 583 - 598
  • [35] Modeling financial data using clustering and tree-based approaches
    Chen, F
    Figlewski, S
    Weigend, AS
    DATA MINING, 1998, : 35 - 51
  • [36] Estimating the Effect of Early Treatment Initiation in Parkinson's Disease Using Observational Data
    van den Heuvel, Lieneke
    Evers, Luc
    Meinders, Marjan
    Post, Bart
    Stiggelbout, Anne
    Heskes, Tom
    Bloem, Bastiaan
    Krijthe, Jesse
    MOVEMENT DISORDERS, 2021, 36 (02) : 407 - 414
  • [37] Dynamic proxy tree-based data dissemination schemes for wireless sensor networks
    Wensheng Zhang
    Guohong Cao
    Tom La Porta
    Wireless Networks, 2007, 13 : 583 - 595
  • [38] Tree-based subgroup discovery using electronic health record data: heterogeneity of treatment effects for DTG-containing therapies
    Yang, Jiabei
    Mwangi, Ann W.
    Kantor, Rami
    Dahabreh, Issa J.
    Nyambura, Monicah
    Delong, Allison
    Hogan, Joseph W.
    Steingrimsson, Jon A.
    BIOSTATISTICS, 2023, 25 (02) : 323 - 335
  • [39] Dynamic proxy tree-based data dissemination schemes for wireless sensor networks
    Zhang, Wensheng
    Cao, Guohong
    La Porta, Tom
    WIRELESS NETWORKS, 2007, 13 (05) : 583 - 595
  • [40] Evaluation of dynamic tree-based data gathering algorithms for wireless sensor networks
    Moh, M
    Dumont, M
    Moh, TS
    2005 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Vols 1 and 2, 2005, : 170 - 175