Multiple Imputation in Survival Models: Applied on Breast Cancer Data

被引:0
|
作者
Baneshi, M. R. [1 ]
Talei, A. R. [2 ]
机构
[1] Kerman Univ Med Sci, Dept Biostat & Epidemiol, Kerman, Iran
[2] Shiraz Univ Med Sci, Dept Surg, Shiraz, Iran
关键词
Prognostic model; Missing data; Multiple imputation; Breast cancer; MISSING PREDICTOR VALUES; RISK;
D O I
暂无
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Missing data is a common problem in cancer research. While simple methods such as complete-case (C-C) analysis are commonly employed for handling this problem, several studies have shown that these methods led to biased estimates. We aim to address the methodological issues in development of a prognostic model with missing data. Methods: Three hundred and ten breast cancer patients were enrolled. At first, patients with missing data on any of four candidate variables were omitted. Secondly, missing data were imputed 10 times. Cox regression model was fitted to the C-C and imputed data. Results were compared in terms of variables retained in the model, discrimination ability, and goodness of fit. Results: Some variables lost their effect in complete-case analysis, due to loss in power, but reached significance level after imputation of missing data. Discrimination ability and goodness of fit of imputed data sets model was higher than that of complete-case model (C-index 76% versus 72%; Likelihood Ratio Test 51.19 versus 32.44). Conclusion: Our findings showed inappropriateness of ad hoc complete-case analysis. This approach led to loss in power and imprecise estimates. Application of multiple imputation techniques to avid such problems is recommended.
引用
收藏
页码:544 / 549
页数:6
相关论文
共 50 条
  • [11] COX REGRESSION MODELS WITH NONPROPORTIONAL HAZARDS APPLIED TO LUNG CANCER SURVIVAL DATA
    Ata, Nihal
    Sozer, M. Tekin
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2007, 36 (02): : 157 - 167
  • [12] Missing data imputation in longitudinal cohort studies - application of PLANN-ARD in breast cancer survival
    Fernandes, Ana S.
    Jarman, Ian H.
    Etchells, Terence A.
    Fonseca, Jose M.
    Biganzoli, Ella
    Bajdik, Chris
    Lisboa, Paulo J. G.
    SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 644 - +
  • [13] A multiple imputation approach for clustered interval-censored survival data
    Lam, K. F.
    Xu, Ying
    Cheung, Tak-Lun
    STATISTICS IN MEDICINE, 2010, 29 (06) : 680 - 693
  • [14] Missing data in association studies: a multiple imputation approach applied to case/control and family data
    Croiseau, P
    Genin, E
    Cordell, HJ
    GENETIC EPIDEMIOLOGY, 2005, 29 (03) : 241 - 241
  • [15] Survival Models in Breast Cancer Patients
    Rajaeefard, A. R.
    Baneshi, M. R.
    Talei, A. R.
    Mehrabani, D.
    IRANIAN RED CRESCENT MEDICAL JOURNAL, 2009, 11 (03) : 295 - 300
  • [16] Multiple imputation of incomplete multilevel data using Heckman selection models
    Munoz, Johanna
    Efthimiou, Orestis
    Audigier, Vincent
    de Jong, Valentijn M. T.
    Debray, Thomas P. A.
    STATISTICS IN MEDICINE, 2024, 43 (03) : 514 - 533
  • [17] Missing data imputation on the 5-year survival prediction of breast cancer patients with unknown discrete values
    Garcia-Laencina, Pedro J.
    Abreu, Pedro Henriques
    Abreu, Miguel Henriques
    Afonoso, Noemia
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 59 : 125 - 133
  • [18] Variable selection for additive models with missing data via multiple imputation
    Yuta Shimazu
    Takayuki Yamaguchi
    Ibuki A. J. Hoshina
    Hidetoshi Matsui
    Behaviormetrika, 2025, 52 (1) : 163 - 178
  • [19] Multivariate Parametric Spatiotemporal Models for County Level Breast Cancer Survival Data
    Xiaoping Jin
    Bradley P. Carlin
    Lifetime Data Analysis, 2005, 11 : 5 - 27
  • [20] Multivariate parametric spatiotemporal models for county level breast cancer survival data
    Jin, XP
    Carlin, BP
    LIFETIME DATA ANALYSIS, 2005, 11 (01) : 5 - 27