Decision tree based predictive models for breast cancer survivability on imbalanced data

被引:0
|
作者
Liu Ya-Qin [1 ]
Wang Cheng [1 ]
Zhang Lu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Biomed Engn, Sch Basic Med, Shanghai 200030, Peoples R China
关键词
imbalanced data; decision tree; predictive breast cancer survivability; 10-fold stratified cross-validation; bagging algorithm;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Based on imbalanced data, the predictive models for 5-year survivability of breast cancer using decision tree are proposed. After data preprocessing from SEER breast cancer datasets, it is obviously that the category of data distribution is imbalanced. Under-sampling is taken to make up the disadvantage of the performance of models caused by the imbalanced data. The performance of the models is evaluated by AUC under ROC curve, accuracy, specificity and sensitivity with 10-fold stratified cross-validation. The performance of models is best while the distribution of data is approximately equal. Bagging algorithm is used to build an integration decision tree model for predicting breast cancer survivability.
引用
收藏
页码:312 / 315
页数:4
相关论文
共 50 条
  • [21] Decision tree and artificial immune systems for stroke prediction in imbalanced data
    Santos, Laercio Ives
    Camargos, Murilo Osorio
    Silveira Vasconcelos D'Angelo, Marcos Flavio
    Mendes, Joao Batista
    Camargos de Medeiros, Egydio Emiliano
    Sena Guimaraes, Andre Luiz
    Palhares, Reinaldo Martinez
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 191
  • [22] Decision tree and artificial immune systems for stroke prediction in imbalanced data
    Santos, Laércio Ives
    Camargos, Murilo Osorio
    D'Angelo, Marcos Flávio Silveira Vasconcelos
    Mendes, João Batista
    Medeiros, Egydio Emiliano Camargos de
    Guimarães, André Luiz Sena
    Palhares, Reinaldo Martínez
    [J]. Expert Systems with Applications, 2022, 191
  • [23] Breast cancer data analysis for survivability studies and prediction
    Shukla, Nagesh
    Hagenbuchner, Markus
    Win, Khin Than
    Yang, Jack
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2018, 155 : 199 - 208
  • [24] Data mining models for predicting oral cancer survivability
    Sharma N.
    Om H.
    [J]. Network Modeling Analysis in Health Informatics and Bioinformatics, 2013, 2 (4) : 285 - 295
  • [25] Identification of the Treatment Survivability Gene Biomarkers of Breast Cancer Patients via a Tree-Based Approach
    Abou Tabl, Ashraf
    Alkhateeb, Abedalrhman
    Rueda, Luis
    ElMaraghy, Waguih
    Ngom, Alioune
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2018, PT I, 2018, 10813 : 166 - 176
  • [26] A new imbalanced learning and dictions tree method for breast cancer diagnosis
    Parvin, Hamid
    Minaei-Bidgoli, Behrouz
    Alinejad-Rokny, Hamid
    [J]. Journal of Bionanoscience, 2013, 7 (06): : 673 - 678
  • [27] A Data Integration Approach for Detecting Biomarkers of Breast Cancer Survivability
    Huy Quang Pham
    Rueda, Luis
    Ngom, Alioune
    [J]. BIOINFORMATICS AND BIOMEDICAL ENGINEERING (IWBBIO 2020), 2020, 12108 : 49 - 60
  • [28] Fuzzy Decision Tree for Breast Cancer Prediction
    Domingo, Mylene J.
    Gerardo, Bobby D.
    Medina, Ruji P.
    [J]. PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION SCIENCE AND SYSTEM, AISS 2019, 2019,
  • [29] Developing a utility decision framework to evaluate predictive models in breast cancer risk estimation
    Wu, Yirong
    Abbey, Craig K.
    Chen, Xianqiao
    Liu, Jie
    Page, David C.
    Alagoz, Oguzhan
    Peissig, Peggy
    Onitilo, Adedayo A.
    Burnside, Elizabeth S.
    [J]. JOURNAL OF MEDICAL IMAGING, 2015, 2 (04)
  • [30] Building Decision Rules for Decision-Making: A Study of Thai Breast Cancer Survivability Prediction
    Thongkam, Jaree
    Sukmak, Vatinee
    Xu, Guandong
    Zhang, Yanchun
    [J]. 2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL V, 2010, : 213 - 216