Breast Cancer Recurrence Risk Predictor Using a Deep Learning Multi-omics Data Integration Framework

被引:0
|
作者
Rahman, Ariana [1 ]
Zhang, Yining [1 ]
Park, Jin G. [1 ]
机构
[1] Arizona State Univ, Biodesign Inst, Ctr Personalized Diagnost, Tempe, AZ 85287 USA
关键词
breast cancer; recurrence; deep learning; multiomics; autoencoder; feature extraction; random forests;
D O I
10.1109/CBMS58004.2023.00343
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last few decades, many breast cancer patients have become disease-free due to the advancements in detection, diagnosis, and treatment. However, cancer can come back for almost 30% of patients who are disease free after initial treatment. Current methods of recurrence risk prediction use clinical or phenotypic characteristics which may not give a complete understanding of the underlying genetic causes of the recurrence. Although there are some gene profiling available for recurrence prediction such as Oncotype DX, they are limited to estrogen receptor-positive tumors. The aim of this study was to utilize multi-omics data of gene expression, copy number variation, mutation, and microRNA, in the recurrence rate prediction of any breast cancer tumor. For this project, a modified deep learning-based autoencoder structure known as AIME was used to extract accurate integrated embeddings. Unlike a typical autoencoder, where the input and output layers are the same, AIME uses two different data types such as gene expression and copy number variation, and, at the same time, adjusts for the confounding factors which in this case, was the estrogen receptor status. The integrated embedding data from AIME was then applied to a supervised machine-learning approach of Random Forest algorithm to classify samples into disease-free and tumor-regressed categories. As part of the AIME output, a list of the 25 most significant genes for breast cancer recurrence was identified which could be effectively utilized both as biomarkers to improve patient diagnostics and develop targeted therapies, and as molecular signature to investigate mechanisms of tumor progression and recursion. This approach can be expanded to include other cancer types to predict and understand tumor recursion.
引用
收藏
页码:921 / 922
页数:2
相关论文
共 50 条
  • [31] Improving survival prediction using flexible late fusion machine learning framework for multi-omics data integration
    Nikolaou, Nikos
    Salazar, Domingo
    RaviPrakash, Harish
    Goncalves, Miguel
    Argoty, Gustavo Alonso Arango
    Burlutsky, Nikolay
    Markuzon, Natasha
    Jacob, Etai
    [J]. CANCER RESEARCH, 2023, 83 (07)
  • [32] Integration strategies of multi-omics data for machine learning analysis
    Picard, Milan
    Scott-Boyer, Marie -Pier
    Bodein, Antoine
    Perin, Olivier
    Droit, Arnaud
    [J]. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 3735 - 3746
  • [33] Integration of Multi-Omics Data for Gene Regulatory Network Inference and Application to Breast Cancer
    Yuan, Lin
    Guo, Le-Hang
    Yuan, Chang-An
    Zhang, Youhua
    Han, Kyungsook
    Nandi, Asoke K.
    Honig, Barry
    Huang, De-Shuang
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 782 - 791
  • [34] Deep-Learning Algorithm and Concomitant Biomarker Identification for NSCLC Prediction Using Multi-Omics Data Integration
    Park, Min-Koo
    Lim, Jin-Muk
    Jeong, Jinwoo
    Jang, Yeongjae
    Lee, Ji-Won
    Lee, Jeong-Chan
    Kim, Hyungyu
    Koh, Euiyul
    Hwang, Sung-Joo
    Kim, Hong-Gee
    Kim, Keun-Cheol
    [J]. BIOMOLECULES, 2022, 12 (12)
  • [35] Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning
    Huang, Yiran
    Zeng, Pingfan
    Zhong, Cheng
    [J]. BMC BIOINFORMATICS, 2024, 25 (01)
  • [36] Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning
    Yiran Huang
    Pingfan Zeng
    Cheng Zhong
    [J]. BMC Bioinformatics, 25
  • [37] Multi-omics integration method based on attention deep learning network for biomedical data classification
    Gong, Ping
    Cheng, Lei
    Zhang, Zhiyuan
    Meng, Ao
    Li, Enshuo
    Chen, Jie
    Zhang, Longzhen
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 231
  • [38] Multi-omics data integration reveals metabolome as the top predictor of the cervicovaginal microenvironment
    Bokulich, Nicholas A.
    Laniewski, Pawel
    Adamov, Anja
    Chase, Dana M.
    Caporaso, J. Gregory
    Herbst-Kralovetz, Melissa M.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (02)
  • [39] DeepDRK: a deep learning framework for drug repurposing through kernel-based multi-omics integration
    Wang, Yongcui
    Yang, Yingxi
    Chen, Shilong
    Wang, Jiguang
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [40] Deep Learning-Based Multi-Omics Integration Robustly Predicts Relapse in Prostate Cancer
    Wei, Ziwei
    Han, Dunsheng
    Zhang, Cong
    Wang, Shiyu
    Liu, Jinke
    Chao, Fan
    Song, Zhenyu
    Chen, Gang
    [J]. FRONTIERS IN ONCOLOGY, 2022, 12