Identifying Candidate Disease Genes in Multilayer Heterogeneous Biological Networks

被引:0
|
作者
Ding C.-F. [1 ]
Wang J. [1 ]
Zhang Z.-Y. [1 ]
机构
[1] College of Mathematics and Computer Science, Yan'an University, Yan'an
来源
基金
中国国家自然科学基金;
关键词
biased random walk; biological network; candidate gene identification; Multilayer heterogeneous network;
D O I
10.16383/j.aas.c210577
中图分类号
学科分类号
摘要
Most of existing random walk methods to identify candidate disease genes preferentially visit highly-connected genes, while unwell-known or poorly-connected genes probably relevant to known diseases are more easily ignored or complicated to identify. Moreover, these methods access only a single gene network or an aggregated network of various gene data, leading to bias and incompleteness. Therefore, it is a pressing challenge for controlling the motion direction of random walk and for integrating multiple data sources involving different information for disease-gene identification. To this end, we first construct a multilayer network and multilayer heterogeneous genetic network. Then, we propose a topologically biased random walk with restart (BRWR) algorithm applicable to multilayer and multilayer heterogeneous networks for the identification of candidate disease genes. Experimental results show that the BRWR algorithm to identify candidate disease genes outperforms the state-of-the-art ones on different types of networks. Finally, the BRWR algorithm on multilayer heterogeneous networks is used to predict disease genes implicated in the undiagnosed neonatal progeroid syndrome. © 2024 Science Press. All rights reserved.
引用
收藏
页码:1246 / 1260
页数:14
相关论文
共 59 条
  • [1] Guala D, Sonnhammer E L L., A large-scale benchmark of gene prioritization methods, Scientific Reports, 7, (2017)
  • [2] Schwikowski B, Uetz P, Fields S., A network of protein-protein interactions in yeast, Nature Biotechnology, 18, 12, pp. 1257-1261, (2000)
  • [3] Sharma V, Ranjan T, Kumar P, Pal A K, Jha V K, Sahni S, Et al., Protein-protein interaction detection: Methods and analysis, Plant Biotechnology, pp. 391-411, (2018)
  • [4] Chen Y, Jiang T, Jiang R., Uncover disease genes by maximizing information flow in the phenome-interactome network, Bioinformatics, 27, 13, pp. i167-i176, (2011)
  • [5] Zhang Y G, Wang Y, Liu J H, Liu X H, Hong Y X, Fan X, Et al., IDLP: A novel label propagation framework for disease gene prioritization, Proceedings of the 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAK-DD), pp. 261-272, (2018)
  • [6] Chen Y, Wu X B, Jiang R., Integrating human omics data to prioritize candidate genes, BMC Medical Genomics, 6, (2013)
  • [7] Lee J H, Zhao X M, Yoon I, Lee J Y, Kwon N H, Wang Y Y, Et al., Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers, Cell Discovery, 2, (2016)
  • [8] Yang K, Zhao X Z, Waxman D, Zhao X M., Predicting drug-disease associations with heterogeneous network embedding, Chaos, 29, 12, (2019)
  • [9] Yang A Y, Chen J Q, Zhao X M., nMAGMA: A network-enhanced method for inferring risk genes from GWAS summary statistics and its application to schizophrenia, Briefings in Bioinformatics, 22, 4, (2021)
  • [10] Kohler S, Bauer S, Horn D, Robinson P N., Walking the interactome for prioritization of candidate disease genes, The American Journal of Human Genetics, 82, 4, pp. 949-958, (2008)