A random walk-based method to identify driver genes by integrating the subcellular localization and variation frequency into bipartite graph

被引:25
|
作者
Song, Junrong [1 ]
Peng, Wei [1 ]
Wang, Feng [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Management & Econ, Comp Ctr, Fac Informat Engn & Automat,Technol Applicat Key, Lianhua Rd, Kunming 650050, Yunnan, Peoples R China
基金
中国国家自然科学基金;
关键词
Driver genes; Random walk; Subcellular localization; Variation frequency; Dysregulated genes; Genomic expression; CANCER GENES; NETWORK; MUTATIONS; DISCOVERY; PATHWAYS;
D O I
10.1186/s12859-019-2847-9
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundCancer as a worldwide problem is driven by genomic alterations. With the advent of high-throughput sequencing technology, a huge amount of genomic data generates at every second which offer many valuable cancer information and meanwhile throw a big challenge to those investigators. As the major characteristic of cancer is heterogeneity and most of alterations are supposed to be useless passenger mutations that make no contribution to the cancer progress. Hence, how to dig out driver genes that have effect on a selective growth advantage in tumor cells from those tremendously and noisily data is still an urgent task.ResultsConsidering previous network-based method ignoring some important biological properties of driver genes and the low reliability of gene interactive network, we proposed a random walk method named as Subdyquency that integrates the information of subcellular localization, variation frequency and its interaction with other dysregulated genes to improve the prediction accuracy of driver genes. We applied our model to three different cancers: lung, prostate and breast cancer. The results show our model can not only identify the well-known important driver genes but also prioritize the rare unknown driver genes. Besides, compared with other existing methods, our method can improve the precision, recall and fscore to a higher level for most of cancer types.ConclusionsThe final results imply that driver genes are those prone to have higher variation frequency and impact more dysregulated genes in the common significant compartment.AvailabilityThe source code can be obtained at https://github.com/weiba/Subdyquency.
引用
收藏
页数:17
相关论文
共 9 条
  • [1] A random walk-based method to identify driver genes by integrating the subcellular localization and variation frequency into bipartite graph
    Junrong Song
    Wei Peng
    Feng Wang
    [J]. BMC Bioinformatics, 20
  • [2] A Random Walk-Based Method to Identify Candidate Genes Associated With Lymphoma
    Sheng, Minjie
    Cai, Haiying
    Yang, Qin
    Li, Jing
    Zhang, Jian
    Liu, Lihua
    [J]. FRONTIERS IN GENETICS, 2021, 12
  • [3] A Review of Random Walk-Based Method for the Identification of Disease Genes and Disease Modules
    Hui, Tay Xin
    Kasim, Shahreen
    Fudzee, Mohd Farhan Md.
    Sutikno, Tole
    Hassan, Rohayanti
    Aziz, Izzatdin Abdul
    Hasan, Mohd Hilmi
    Jaafar, Jafreezal
    Alharbi, Metab
    Sen, Seah Choon
    [J]. IEEE ACCESS, 2023, 11 : 116366 - 116383
  • [4] A random walk-based method for detecting essential proteins by integrating the topological and biological features of PPI network
    Ahmed, Nahla Mohamed
    Chen, Ling
    Li, Bin
    Liu, Wei
    Dai, Caiyan
    [J]. SOFT COMPUTING, 2021, 25 (14) : 8883 - 8903
  • [5] A random walk-based method for detecting essential proteins by integrating the topological and biological features of PPI network
    Nahla Mohamed Ahmed
    Ling Chen
    Bin Li
    Wei Liu
    Caiyan Dai
    [J]. Soft Computing, 2021, 25 : 8883 - 8903
  • [6] A weight optimization method for chemical safety evaluation indicators based on the bipartite graph and random walk
    Du, Junwei
    Jing, Guanghui
    Hu, Qiang
    [J]. JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (04) : 1214 - 1217
  • [7] Identifying Pathological Myopia Associated Genes with A Random Walk-Based Method in Protein-Protein Interaction Network
    Zhang, Jiyu
    Huang, Tao
    Sun, Qiao
    Zhang, Jian
    [J]. CURRENT BIOINFORMATICS, 2024, 19 (04) : 375 - 384
  • [8] Random walk based method to identify essential proteins by integrating network topology and biological characteristics
    Lei, Xiujuan
    Yang, Xiaoqin
    Fujita, Hamido
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 167 : 53 - 67
  • [9] A network-based method using a random walk with restart algorithm and screening tests to identify novel genes associated with Meniere's disease
    Li, Lin
    Wang, YanShu
    An, Lifeng
    Kong, XiangYin
    Huang, Tao
    [J]. PLOS ONE, 2017, 12 (08):