Integrating Somatic Mutations for Breast Cancer Survival Prediction Using Machine Learning Methods

被引:9
|
作者
He, Zongzhen [1 ]
Zhang, Junying [1 ]
Yuan, Xiguo [1 ]
Zhang, Yuanyuan [2 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[2] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
关键词
breast cancer; multi-omics; survival prediction; somatic mutation; mRMR; MKL; EXPRESSION; PROGNOSIS;
D O I
10.3389/fgene.2020.632901
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Breast cancer is the most common malignancy in women, and because it has a high mortality rate, it is urgent to develop computational methods to increase the accuracy of breast cancer survival predictive models. Although multi-omics data such as gene expression have been extensively used in recent studies, the accurate prognosis of breast cancer remains a challenge. Somatic mutations are another important and promising data source for studying cancer development, and its effect on the prognosis of breast cancer remains to be further explored. Meanwhile, these omics datasets are high-dimensional and redundant. Therefore, we adopted multiple kernel learning (MKL) to efficiently integrate somatic mutation to currently molecular data including gene expression, copy number variation (CNV), methylation, and protein expression data for the prediction of breast cancer survival. Before integration, the maximum relevance minimum redundancy (mRMR) feature selection method was utilized to select features that present high relevance to survival and low redundancy among themselves for each type of data. The experimental results demonstrated that the proposed method achieved the most optimal performance and there was a remarkable improvement in the prediction performance when somatic mutations were included, indicating that somatic mutations are critical for improving breast cancer survival predictions. Moreover, mRMR was superior to other feature selection methods used in previous studies. Furthermore, MKL outperformed the other traditional classifiers in multi-omics data integration. Our analysis indicated that through employing promising omics data such as somatic mutations and harnessing the power of proper feature selection methods and effective integration frameworks, the breast cancer survival predictive accuracy can be further increased, thereby providing a more optimal clinical diagnosis and more effective treatment for breast cancer patients.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Commentary on "A systematic review on machine learning and deep learning techniques in cancer survival prediction": Validation of survival methods
    Sidorova, J.
    Lozano, J. J.
    PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 2023, 183 : 17 - 18
  • [32] Survival Prediction in Gallbladder Cancer Using CT Based Machine Learning
    Liu, Zefan
    Zhu, Guannan
    Jiang, Xian
    Zhao, Yunuo
    Zeng, Hao
    Jing, Jing
    Ma, Xuelei
    FRONTIERS IN ONCOLOGY, 2020, 10
  • [33] Multimodal survival prediction in advanced pancreatic cancer using machine learning
    Keyl, J.
    Kasper, S.
    Wiesweg, M.
    Goetze, J.
    Schoenrock, M.
    Sinn, M.
    Berger, A.
    Nasca, E.
    Kostbade, K.
    Schumacher, B.
    Markus, P.
    Albers, D.
    Treckmann, J.
    Schmid, K. W.
    Schildhaus, H-U
    Siveke, J. T.
    Schuler, M.
    Kleesiek, J.
    ESMO OPEN, 2022, 7 (05)
  • [34] Improved survival prediction for pancreatic cancer using machine learning and regression
    Floyd, Stuart H.
    Alvarez, Sergio A.
    Ruiz, Carolina
    Hayward, John
    Sullivan, Mary
    Tseng, Jennifer F.
    Whalen, Giles F.
    GASTROENTEROLOGY, 2007, 132 (04) : A869 - A870
  • [35] Machine learning prediction of breast cancer survival using age, sex, length of stay, mode of diagnosis and location of cancer
    Hilary I. Okagbue
    Patience I. Adamu
    Pelumi E. Oguntunde
    Emmanuela C. M. Obasi
    Oluwole A. Odetunmibi
    Health and Technology, 2021, 11 : 887 - 893
  • [36] Machine learning prediction of breast cancer survival using age, sex, length of stay, mode of diagnosis and location of cancer
    Okagbue, Hilary I.
    Adamu, Patience I.
    Oguntunde, Pelumi E.
    Obasi, Emmanuela C. M.
    Odetunmibi, Oluwole A.
    HEALTH AND TECHNOLOGY, 2021, 11 (04) : 887 - 893
  • [37] TOWARDS INDIVIDUALIZED SURVIVAL PREDICTION IN GLIOBLASTOMA PATIENTS USING MACHINE LEARNING METHODS
    Vera, L.
    Perez-Beteta, J.
    Molina, D.
    Borras, J. M.
    Benavides, M.
    Barcia, J. A.
    Velasquez, C.
    Albillo, D.
    Lara, P.
    Perez-Garcia, V. M.
    NEURO-ONCOLOGY, 2017, 19 : 84 - 84
  • [38] Survival analysis of breast cancer patients using machine learning models
    Evangeline, I. Keren
    Kirubha, S. P. Angeline
    Precious, J. Glory
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 30909 - 30928
  • [39] Survival analysis of breast cancer patients using machine learning models
    Keren Evangeline I.
    S. P. Angeline Kirubha
    J. Glory Precious
    Multimedia Tools and Applications, 2023, 82 : 30909 - 30928
  • [40] Machine Learning Explainability in Breast Cancer Survival
    Jansen, Tom
    Geleijnse, Gijs
    Van Maaren, Marissa
    Hendriks, Mathijs P.
    Ten Teije, Annette
    Moncada-Torres, Arturo
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 307 - 311