Integrating Somatic Mutations for Breast Cancer Survival Prediction Using Machine Learning Methods

被引:9
|
作者
He, Zongzhen [1 ]
Zhang, Junying [1 ]
Yuan, Xiguo [1 ]
Zhang, Yuanyuan [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
关键词
breast cancer; multi-omics; survival prediction; somatic mutation; mRMR; MKL; EXPRESSION; PROGNOSIS;
D O I
10.3389/fgene.2020.632901
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Breast cancer is the most common malignancy in women, and because it has a high mortality rate, it is urgent to develop computational methods to increase the accuracy of breast cancer survival predictive models. Although multi-omics data such as gene expression have been extensively used in recent studies, the accurate prognosis of breast cancer remains a challenge. Somatic mutations are another important and promising data source for studying cancer development, and its effect on the prognosis of breast cancer remains to be further explored. Meanwhile, these omics datasets are high-dimensional and redundant. Therefore, we adopted multiple kernel learning (MKL) to efficiently integrate somatic mutation to currently molecular data including gene expression, copy number variation (CNV), methylation, and protein expression data for the prediction of breast cancer survival. Before integration, the maximum relevance minimum redundancy (mRMR) feature selection method was utilized to select features that present high relevance to survival and low redundancy among themselves for each type of data. The experimental results demonstrated that the proposed method achieved the most optimal performance and there was a remarkable improvement in the prediction performance when somatic mutations were included, indicating that somatic mutations are critical for improving breast cancer survival predictions. Moreover, mRMR was superior to other feature selection methods used in previous studies. Furthermore, MKL outperformed the other traditional classifiers in multi-omics data integration. Our analysis indicated that through employing promising omics data such as somatic mutations and harnessing the power of proper feature selection methods and effective integration frameworks, the breast cancer survival predictive accuracy can be further increased, thereby providing a more optimal clinical diagnosis and more effective treatment for breast cancer patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Prediction of Breast Cancer Survival by Machine Learning Methods: An Application of Multiple Imputation
    Lotfnezhad Afshar, Hadi
    Jabbari, Nasrollah
    Khalkhali, Hamid Reza
    Esnaashari, Omid
    IRANIAN JOURNAL OF PUBLIC HEALTH, 2021, 50 (03) : 598 - 605
  • [2] NeoMutate: an ensemble machine learning framework for the prediction of somatic mutations in cancer
    Irantzu Anzar
    Angelina Sverchkova
    Richard Stratford
    Trevor Clancy
    BMC Medical Genomics, 12
  • [3] NeoMutate: an ensemble machine learning framework for the prediction of somatic mutations in cancer
    Anzar, Irantzu
    Sverchkova, Angelina
    Stratford, Richard
    Clancy, Trevor
    BMC MEDICAL GENOMICS, 2019, 12 (1)
  • [4] Machine learning models in breast cancer survival prediction
    Montazeri, Mitra
    Montazeri, Mohadeseh
    Montazeri, Mahdieh
    Beigzadeh, Amin
    TECHNOLOGY AND HEALTH CARE, 2016, 24 (01) : 31 - 42
  • [5] Prediction of survival and metastasis in breast cancer patients using machine learning classifiers
    Tapak, Leili
    Shirmohammadi-Khorram, Nasrin
    Amini, Payam
    Alafchi, Behnaz
    Hamidi, Omid
    Poorolajal, Jalal
    CLINICAL EPIDEMIOLOGY AND GLOBAL HEALTH, 2019, 7 (03): : 293 - 299
  • [6] Application of machine learning in breast cancer survival prediction using a multimethod approach
    Seyedeh Zahra Hamedi
    Hassan Emami
    Maryam Khayamzadeh
    Reza Rabiei
    Mehrad Aria
    Majid Akrami
    Vahid Zangouri
    Scientific Reports, 14 (1)
  • [7] A Comparison of Machine Learning Methods for the Prediction of Breast Cancer
    Silva, Sara
    Anunciacao, Orlando
    Lotz, Marco
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, 2011, 6623 : 159 - +
  • [8] Machine Learning and Deep Learning Approaches in Breast Cancer Survival Prediction Using Clinical Data
    Kalafi, E. Y.
    Nor, N. A. M.
    Taib, N. A.
    Ganggayah, M. D.
    Town, C.
    Dhillon, S. K.
    FOLIA BIOLOGICA, 2019, 65 (5-6) : 212 - 220
  • [9] A comparison of machine learning techniques for survival prediction in breast cancer
    Leonardo Vanneschi
    Antonella Farinaccio
    Giancarlo Mauri
    Marco Antoniotti
    Paolo Provero
    Mario Giacobini
    BioData Mining, 4
  • [10] Machine Learning Techniques for Survival Time Prediction in Breast Cancer
    Mihaylov, Iliyan
    Nisheva, Maria
    Vassilev, Dimitar
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 186 - 194