Constructing a gene semantic similarity network for the inference of disease genes

被引:66
|
作者
Jiang, Rui [1 ,2 ]
Gan, Mingxin [3 ]
He, Peng [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Automat, TNLIST, MOE Key Lab Bioinformat, Beijing 100084, Peoples R China
[2] Tsinghua Univ, Dept Automat, TNLIST, Bioinformat Div, Beijing 100084, Peoples R China
[3] Univ Sci & Technol Beijing, Sch Econ & Management, Beijing 100083, Peoples R China
来源
BMC SYSTEMS BIOLOGY | 2011年 / 5卷
基金
中国国家自然科学基金;
关键词
PHENOME-INTERACTOME NETWORK; CANDIDATE GENES; PRIORITIZATION; ONTOLOGY; WALKING; TRAITS; TERMS;
D O I
10.1186/1752-0509-5-S2-S2
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation: The inference of genes that are truly associated with inherited human diseases from a set of candidates resulting from genetic linkage studies has been one of the most challenging tasks in human genetics. Although several computational approaches have been proposed to prioritize candidate genes relying on protein-protein interaction (PPI) networks, these methods can usually cover less than half of known human genes. Results: We propose to rely on the biological process domain of the gene ontology to construct a gene semantic similarity network and then use the network to infer disease genes. We show that the constructed network covers about 50% more genes than a typical PPI network. By analyzing the gene semantic similarity network with the PPI network, we show that gene pairs tend to have higher semantic similarity scores if the corresponding proteins are closer to each other in the PPI network. By analyzing the gene semantic similarity network with a phenotype similarity network, we show that semantic similarity scores of genes associated with similar diseases are significantly different from those of genes selected at random, and that genes with higher semantic similarity scores tend to be associated with diseases with higher phenotype similarity scores. We further use the gene semantic similarity network with a random walk with restart model to infer disease genes. Through a series of large-scale leave-one-out cross-validation experiments, we show that the gene semantic similarity network can achieve not only higher coverage but also higher accuracy than the PPI network in the inference of disease genes.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Constructing an integrated gene similarity network for the identification of disease genes
    Zhen Tian
    Maozu Guo
    Chunyu Wang
    LinLin Xing
    Lei Wang
    Yin Zhang
    [J]. Journal of Biomedical Semantics, 8
  • [2] Constructing an integrated gene similarity network for the identification of disease genes
    Tian, Zhen
    Guo, Maozu
    Wang, Chunyu
    Xing, LinLin
    Wang, Lei
    Zhang, Yin
    [J]. JOURNAL OF BIOMEDICAL SEMANTICS, 2017, 8
  • [3] Constructing an integrated gene similarity network for the identification of disease genes
    Tian, Zhen
    Guo, Maozu
    Wang, Chunyu
    Xing, Linlin
    Wang, Lei
    Zhang, Yin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1663 - 1668
  • [4] Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity
    Chen, Xing
    Yan, Chenggang Clarence
    Luo, Cai
    Ji, Wen
    Zhang, Yongdong
    Dai, Qionghai
    [J]. SCIENTIFIC REPORTS, 2015, 5
  • [5] Constructing lncRNA functional similarity network based on lncRNA-disease associations and disease semantic similarity
    Xing Chen
    Chenggang Clarence Yan
    Cai Luo
    Wen Ji
    Yongdong Zhang
    Qionghai Dai
    [J]. Scientific Reports, 5
  • [6] Integrating Multiple Gene Semantic Similarity Profiles to Infer Disease Genes
    He Peng
    Jiang Rui
    [J]. PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 7420 - 7425
  • [7] Prioritizing candidate disease genes combined with semantic similarity by walking on the heterogeneous network
    Xiong, Neng
    Jin, Min
    [J]. Journal of Computational and Theoretical Nanoscience, 2015, 12 (11) : 4415 - 4420
  • [8] Prioritization of candidate disease genes by combining topological similarity and semantic similarity
    Liu, Bin
    Jin, Min
    Zeng, Pan
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 57 : 1 - 5
  • [9] Inter-city association pattern recognition by constructing cultural semantic similarity network
    Wang, Haoran
    Zhang, Haiping
    Tang, Guoan
    Zhou, Lei
    Jiang, Shangjing
    [J]. TRANSACTIONS IN GIS, 2022, 26 (05) : 2225 - 2243
  • [10] Prologue Evaluation of Semantic Similarity and Textual Inference
    Fonseca, Erick
    Santos, Leandro
    Criscuolo, Marcelo
    Aluisio, Sandra
    [J]. LINGUAMATICA, 2016, 8 (02): : IX - IX