Semiparametric analysis of clustered interval-censored survival data using soft Bayesian additive regression trees (SBART)

被引:13
|
作者
Basak, Piyali [1 ]
Linero, Antonio [2 ]
Sinha, Debajyoti [1 ]
Lipsitz, Stuart [3 ]
机构
[1] Florida State Univ, Tallahassee, FL 32306 USA
[2] Univ Texas Austin, Austin, TX 78712 USA
[3] Brigham & Womens Hosp, Boston, MA 02115 USA
基金
美国国家科学基金会;
关键词
Bayesian additive regression trees; machine learning; nonproportional hazards; semiparametric; survival analysis; MODELS;
D O I
10.1111/biom.13478
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Popular parametric and semiparametric hazards regression models for clustered survival data are inappropriate and inadequate when the unknown effects of different covariates and clustering are complex. This calls for a flexible modeling framework to yield efficient survival prediction. Moreover, for some survival studies involving time to occurrence of some asymptomatic events, survival times are typically interval censored between consecutive clinical inspections. In this article, we propose a robust semiparametric model for clustered interval-censored survival data under a paradigm of Bayesian ensemble learning, called soft Bayesian additive regression trees or SBART (Linero and Yang, 2018), which combines multiple sparse (soft) decision trees to attain excellent predictive accuracy. We develop a novel semiparametric hazards regression model by modeling the hazard function as a product of a parametric baseline hazard function and a nonparametric component that uses SBART to incorporate clustering, unknown functional forms of the main effects, and interaction effects of various covariates. In addition to being applicable for left-censored, right-censored, and interval-censored survival data, our methodology is implemented using a data augmentation scheme which allows for existing Bayesian backfitting algorithms to be used. We illustrate the practical implementation and advantages of our method via simulation studies and an analysis of a prostate cancer surgery study where dependence on the experience and skill level of the physicians leads to clustering of survival times. We conclude by discussing our method's applicability in studies involving high-dimensional data with complex underlying associations.
引用
收藏
页码:880 / 893
页数:14
相关论文
共 50 条
  • [1] Semiparametric analysis of clustered interval-censored survival data with a cure fraction
    Lam, K. F.
    Wong, Kin-Yau
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 79 : 165 - 174
  • [2] Semiparametric regression analysis of interval-censored data
    Goetghebeur, E
    Ryan, L
    [J]. BIOMETRICS, 2000, 56 (04) : 1139 - 1144
  • [3] Bayesian analysis of clustered interval-censored data
    Wong, MCM
    Lam, KF
    Lo, ECM
    [J]. JOURNAL OF DENTAL RESEARCH, 2005, 84 (09) : 817 - 821
  • [4] Semiparametric regression analysis of clustered interval-censored failure time data with a cured subgroup
    Yang, Dian
    Du, Mingyue
    Sun, Jianguo
    [J]. STATISTICS IN MEDICINE, 2021, 40 (30) : 6918 - 6930
  • [5] Semiparametric efficient estimation for additive hazards regression with case II interval-censored survival data
    He, Baihua
    Liu, Yanyan
    Wu, Yuanshan
    Zhao, Xingqiu
    [J]. LIFETIME DATA ANALYSIS, 2020, 26 (04) : 708 - 730
  • [6] A Semiparametric Regression Method for Interval-Censored Data
    Han, Seungbong
    Andrei, Adin-Cristian
    Tsui, Kam-Wah
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2014, 43 (01) : 18 - 30
  • [7] Semiparametric efficient estimation for additive hazards regression with case II interval-censored survival data
    Baihua He
    Yanyan Liu
    Yuanshan Wu
    Xingqiu Zhao
    [J]. Lifetime Data Analysis, 2020, 26 : 708 - 730
  • [8] Semiparametric Regression Analysis of Clustered Interval-Censored Failure Time Data with Informative Cluster Size
    Zhang, Xinyan
    Sun, Jianguo
    [J]. INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2013, 9 (02): : 205 - 214
  • [9] Bayesian semiparametric model for spatially correlated interval-censored survival data
    Pan, Chun
    Cai, Bo
    Wang, Lianming
    Lin, Xiaoyan
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 74 : 198 - 208
  • [10] Semiparametric regression analysis of interval-censored data with informative dropout
    Gao, Fei
    Zeng, Donglin
    Lin, Dan-Yu
    [J]. BIOMETRICS, 2018, 74 (04) : 1213 - 1222