LOTUS: A single- and multitask machine learning algorithm for the prediction of cancer driver genes

被引:30
|
作者
Collier, Olivier [1 ]
Stoven, Veronique [2 ,3 ,4 ]
Vert, Jean-Philippe [2 ,5 ]
机构
[1] Univ Paris Nanterre, ModalX, UPL, F-92000 Nanterre, France
[2] PSL Univ, MINES ParisTech, CBIO Ctr Computat Biol, F-75006 Paris, France
[3] Inst Curie, F-75248 Paris 5, France
[4] INSERM, U900, F-75248 Paris 5, France
[5] Google Res, Brain Team, F-75009 Paris, France
基金
欧洲研究理事会;
关键词
SOMATIC MUTATIONS; PROLIFERATION; EXPRESSION; HALLMARKS; ONCOGENE; PATHWAYS; MEIS1; KRAS;
D O I
10.1371/journal.pcbi.1007381
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Cancer driver genes, i.e., oncogenes and tumor suppressor genes, are involved in the acquisition of important functions in tumors, providing a selective growth advantage, allowing uncontrolled proliferation and avoiding apoptosis. It is therefore important to identify these driver genes, both for the fundamental understanding of cancer and to help finding new therapeutic targets or biomarkers. Although the most frequently mutated driver genes have been identified, it is believed that many more remain to be discovered, particularly for driver genes specific to some cancer types. In this paper, we propose a new computational method called LOTUS to predict new driver genes. LOTUS is a machine-learning based approach which allows to integrate various types of data in a versatile manner, including information about gene mutations and protein-protein interactions. In addition, LOTUS can predict cancer driver genes in a pan-cancer setting as well as for specific cancer types, using a multitask learning strategy to share information across cancer types. We empirically show that LOTUS outperforms five other state-of-the-art driver gene prediction methods, both in terms of intrinsic consistency and prediction accuracy, and provide predictions of new cancer genes across many cancer types.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] IDriveGenes: Cancer Driver Genes Prediction Using Machine Learning
    Ali, Yasir
    Sardaraz, Muhammad
    Tahir, Muhammad
    Elmannai, Hela
    Hamdi, Monia
    Ksibi, Amel
    [J]. IEEE Access, 2023, 11 : 28439 - 28453
  • [2] IDriveGenes: Cancer Driver Genes Prediction Using Machine Learning
    Ali, Yasir
    Sardaraz, Muhammad
    Tahir, Muhammad
    Elmannai, Hela
    Hamdi, Monia
    Ksibi, Amel
    [J]. IEEE ACCESS, 2023, 11 : 28439 - 28453
  • [3] Machine learning methods for prediction of cancer driver genes: a survey paper
    Andrades, Renan
    Recamonde-Mendoza, Mariana
    [J]. BRIEFINGS IN BIOINFORMATICS, 2022, 23 (03)
  • [4] DriverML: a machine learning algorithm for identifying driver genes in cancer sequencing studies
    Han, Yi
    Yang, Juze
    Qian, Xinyi
    Cheng, Wei-Chung
    Liu, Shu-Hsuan
    Hua, Xing
    Zhou, Liyuan
    Yang, Yaning
    Wu, Qingbiao
    Liu, Pengyuan
    Lu, Yan
    [J]. NUCLEIC ACIDS RESEARCH, 2019, 47 (08)
  • [5] EARN: an ensemble machine learning algorithm to predict driver genes in metastatic breast cancer
    Leila Mirsadeghi
    Reza Haji Hosseini
    Ali Mohammad Banaei-Moghaddam
    Kaveh Kavousi
    [J]. BMC Medical Genomics, 14
  • [6] EARN: an ensemble machine learning algorithm to predict driver genes in metastatic breast cancer
    Mirsadeghi, Leila
    Haji Hosseini, Reza
    Banaei-Moghaddam, Ali Mohammad
    Kavousi, Kaveh
    [J]. BMC MEDICAL GENOMICS, 2021, 14 (01)
  • [7] Machine learning approaches for prediction of ovarian cancer driver genes from mutational and network analysis
    Wadapurkar, Rucha
    Bapat, Sanket
    Mahajan, Rupali
    Vyas, Renu
    [J]. DATA TECHNOLOGIES AND APPLICATIONS, 2024, 58 (01) : 62 - 80
  • [8] Evaluating machine learning methodologies for identification of cancer driver genes
    Sharaf J. Malebary
    Yaser Daanial Khan
    [J]. Scientific Reports, 11
  • [9] Evaluating machine learning methodologies for identification of cancer driver genes
    Malebary, Sharaf J.
    Khan, Yaser Daanial
    [J]. SCIENTIFIC REPORTS, 2021, 11 (01)
  • [10] Prediction of driver variants in the cancer genome via machine learning methodologies
    Rogers, Mark F.
    Gaunt, Tom R.
    Campbell, Colin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)