From multi-omics data to the cancer druggable gene discovery: a novel machine learning-based approach

被引:3
|
作者
Yang, Hai [1 ]
Gan, Lipeng [1 ]
Chen, Rui [2 ]
Li, Dongdong [1 ]
Zhang, Jing [1 ]
Wang, Zhe [1 ]
机构
[1] East China Univ Sci & Technol, Dept Comp Sci & Engn, Shanghai, Peoples R China
[2] Vanderbilt Univ, Dept Mol Physiol & Biophys, Nashville, TN USA
关键词
multi-omics; cancer genomics; machine learning; druggable genome; SOMATIC MUTATIONS; ROS1; REARRANGEMENTS; TARGETING ROS1; PREDICTION; LESSONS;
D O I
10.1093/bib/bbac528
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The development of targeted drugs allows precision medicine in cancer treatment and optimal targeted therapies. Accurate identification of cancer druggable genes helps strengthen the understanding of targeted cancer therapy and promotes precise cancer treatment. However, rare cancer-druggable genes have been found due to the multi-omics data's diversity and complexity. This study proposes deep forest for cancer druggable genes discovery (DF-CAGE), a novel machine learning -based method for cancer-druggable gene discovery. DF-CAGE integrated the somatic mutations, copy number variants, DNA methylation and RNA-Seq data across (similar to)10 000 TCGA profiles to identify the landscape of the cancer-druggable genes. We found that DF-CAGE discovers the commonalities of currently known cancerdruggable genes from the perspective of multi-omics data and achieved excellent performance on OncoKB, Target and Drugbank data sets. Among the (similar to)20 000 protein -coding genes, DF-CAGE pinpointed 465 potential cancer-druggable genes. We found that the candidate cancer druggable genes (CDG) are clinically meaningful and divided the CDG into known, reliable and potential gene sets. Finally, we analyzed the omics data's contribution to identifying druggable genes. We found that DF-CAGE reports druggable genes mainly based on the copy number variations (CNVs) data, the gene rearrangements and the mutation rates in the population. These findings may enlighten the future study and development of new drugs.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations
    Oh, Minsik
    Park, Sungjoon
    Kim, Sun
    Chae, Heejoon
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 66 - 76
  • [2] Machine learning for multi-omics data integration in cancer
    Cai, Zhaoxiang
    Poulos, Rebecca C.
    Liu, Jia
    Zhong, Qing
    ISCIENCE, 2022, 25 (02)
  • [3] MDMNI-DGD: A novel graph neural network approach for druggable gene discovery based on the integration of multi-omics data and the multi-view network
    Li, Jianwei
    Li, Bing
    Zhang, Xukun
    Ma, Xuxu
    Li, Ziyu
    Computers in Biology and Medicine, 2025, 185
  • [4] InDEP: an interpretable machine learning approach to predict cancer driver genes from multi-omics data
    Yang, Hai
    Liu, Yawen
    Yang, Yijing
    Li, Dongdong
    Wang, Zhe
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [5] Machine learning for the analysis of multi-omics data
    Sun, Yanni
    METHODS, 2021, 189 : 1 - 2
  • [6] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Leng, Dongjin
    Zheng, Linyi
    Wen, Yuqi
    Zhang, Yunhao
    Wu, Lianlian
    Wang, Jing
    Wang, Meihong
    Zhang, Zhongnan
    He, Song
    Bo, Xiaochen
    GENOME BIOLOGY, 2022, 23 (01)
  • [7] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Dongjin Leng
    Linyi Zheng
    Yuqi Wen
    Yunhao Zhang
    Lianlian Wu
    Jing Wang
    Meihong Wang
    Zhongnan Zhang
    Song He
    Xiaochen Bo
    Genome Biology, 23
  • [8] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Long-Yi Guo
    Ai-Hua Wu
    Yong-xia Wang
    Li-ping Zhang
    Hua Chai
    Xue-Fang Liang
    BioData Mining, 13
  • [9] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Guo, Long-Yi
    Wu, Ai-Hua
    Wang, Yong-xia
    Zhang, Li-ping
    Chai, Hua
    Liang, Xue-Fang
    BIODATA MINING, 2020, 13 (01)
  • [10] Comparative Evaluation of Machine Learning Models for Subtyping Triple-Negative Breast Cancer: A Deep Learning-Based Multi-Omics Data Integration Approach
    Yang, Shufang
    Wang, Zihu
    Wang, Changfu
    Li, Changbo
    Wang, Binjie
    JOURNAL OF CANCER, 2024, 15 (12): : 3943 - 3957