Deep Learning-based Propensity Scores for Confounding Control in Comparative Effectiveness Research A Large-scale, Real-world Data Study

被引:12
|
作者
Weberpals, Janick [1 ]
Becker, Tim [2 ,3 ]
Davies, Jessica [4 ]
Schmich, Fabian [1 ]
Ruettinger, Dominik [5 ]
Theis, Fabian J. [6 ,7 ]
Bauer-Mehren, Anna [1 ]
机构
[1] Roche Innovat Ctr Munich RICM, Pharmaceut Res & Early Dev Informat pREDi, Data Sci, Penzberg, Germany
[2] xValue GmbH, Willich, Germany
[3] Roche Innovat Ctr Munich RICM, Pharmaceut Res & Early Dev Informat pREDi, Data Sci 4, Penzberg, Germany
[4] F Hoffmann La Roche Ltd, Welwyn Garden City, Herts, England
[5] Roche Innovat Ctr Munich RICM, Pharmaceut Res & Early Dev pRED, Early Clin Dev Oncol, Penzberg, Germany
[6] Helmholtz Ctr Munich, German Res Ctr Environm Hlth, Inst Computat Biol, Neuherberg, Germany
[7] Tech Univ Munich, Dept Math, Garching, Germany
关键词
Autoencoder; Causal inference; Comparative effectiveness research; Deep learning; Electronic health records; Machine learning; Propensity scores; VARIABLE SELECTION; PROGNOSTIC SCORE; BIAS; SIMULATION; ADJUSTMENT; PATIENT; FUTURE; TIME;
D O I
10.1097/EDE.0000000000001338
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Background: Due to the non-randomized nature of real-world data, prognostic factors need to be balanced, which is often done by propensity scores (PSs). This study aimed to investigate whether autoencoders, which are unsupervised deep learning architectures, might be leveraged to compute PS. Methods: We selected patient-level data of 128,368 first-line treated cancer patients from the Flatiron Health EHR-derived de-identified database. We trained an autoencoder architecture to learn a lower-dimensional patient representation, which we used to compute PS. To compare the performance of an autoencoder-based PS with established methods, we performed a simulation study. We assessed the balancing and adjustment performance using standardized mean differences, root mean square errors (RMSE), percent bias, and confidence interval coverage. To illustrate the application of the autoencoder-based PS, we emulated the PRONOUNCE trial by applying the trial's protocol elements within an observational database setting, comparing two chemotherapy regimens. Results: All methods but the manual variable selection approach led to well-balanced cohorts with average standardized mean differences <0.1. LASSO yielded on average the lowest deviation of resulting estimates (RMSE 0.0205) followed by the autoencoder approach (RMSE 0.0248). Altering the hyperparameter setup in sensitivity analysis, the autoencoder approach led to similar results as LASSO (RMSE 0.0203 and 0.0205, respectively). In the case study, all methods provided a similar conclusion with point estimates clustered around the null (e.g., HRautoencoder 1.01 [95% confidence interval = 0.80, 1.27] vs. HRPRONOUNCE 1.07 [0.83, 1.36]). Conclusions: Autoencoder-based PS computation was a feasible approach to control for confounding but did not perform better than some established approaches like LASSO.
引用
收藏
页码:378 / 388
页数:11
相关论文
共 50 条
  • [1] Deep learning-based propensity scores for confounding control in comparative effectiveness research: A large-scale real-world data study
    Weberpals, Janick
    Becker, Tim
    Davies, Jessica
    Schmich, Fabian
    Ruettinger, Dominik
    Theis, Fabian
    Bauer-Mehren, Anna
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2020, 29 : 382 - 383
  • [2] Large-scale real-world radio signal recognition with deep learning
    Ya TU
    Yun LIN
    Haoran ZHA
    Ju ZHANG
    Yu WANG
    Guan GUI
    Shiwen MAO
    [J]. Chinese Journal of Aeronautics, 2022, 35 (09) : 35 - 48
  • [3] Large-scale real-world radio signal recognition with deep learning
    Ya TU
    Yun LIN
    Haoran ZHA
    Ju ZHANG
    Yu WANG
    Guan GUI
    Shiwen MAO
    [J]. Chinese Journal of Aeronautics, 2022, (09) - 48
  • [4] Large-scale real-world radio signal recognition with deep learning
    Tu, Ya
    Lin, Yun
    Zha, Haoran
    Zhang, Ju
    Wang, Yu
    Gui, Guan
    Mao, Shiwen
    [J]. CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (09) : 35 - 48
  • [5] Evaluating large-scale propensity score performance through real-world and synthetic data experiments
    Tian, Yuxi
    Schuemie, Martijn J.
    Suchard, Marc A.
    [J]. INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2018, 47 (06) : 2005 - 2014
  • [6] Deep Learning-Based Sentimental Analysis for Large-Scale Imbalanced Twitter Data
    Jamal, Nasir
    Chen, Xianqiao
    Aldabbas, Hamza
    [J]. FUTURE INTERNET, 2019, 11 (09)
  • [7] Real-World Effectiveness of Lung Cancer Screening Using Deep Learning-Based Counterfactual Prediction
    Feng, Zheng
    Chen, Zhaoyi
    Guo, Yi
    Prosperi, Mattia
    Mehta, Hiren
    Braithwaite, Dejana
    Wu, Yonghui
    Bian, Jiang
    [J]. MEDINFO 2023 - THE FUTURE IS ACCESSIBLE, 2024, 310 : 419 - 423
  • [8] Prevalence and Comorbidity of Atopic Dermatitis in Children: A Large-Scale Population Study Based on Real-World Data
    Gilaberte, Yolanda
    Blas Perez-Gilaberte, Juan
    Poblador-Plou, Beatriz
    Bliek-Bueno, Kevin
    Gimeno-Miguel, Antonio
    Prados-Torres, Alexandra
    [J]. JOURNAL OF CLINICAL MEDICINE, 2020, 9 (06)
  • [9] A large-scale real-world study for colorectal cancer screening
    Liu, S.
    Wang, Y.
    Zhuo, D.
    Peng, J.
    Jiang, R.
    Duan, C.
    Liu, F.
    Zhang, H.
    Tian, X.
    Ding, X.
    Zhang, M.
    Cao, D.
    Liu, Y.
    Wang, Y.
    Shi, Z.
    [J]. ANNALS OF ONCOLOGY, 2023, 34 : S418 - S418
  • [10] How Symmetric Are Real-World Graphs? A Large-Scale Study
    Ball, Fabian
    Geyer-Schulz, Andreas
    [J]. SYMMETRY-BASEL, 2018, 10 (01):