Enhancing gene regulatory networks inference through hub-based data integration

被引:0
|
作者
Naseri, Atefeh [1 ]
Sharghi, Mehran [1 ,2 ]
Hasheminejad, Seyed Mohammad Hossein [1 ]
机构
[1] Alzahra Univ, Dept Comp Engn, Tehran, Iran
[2] Heriot Watt Univ, Edinburgh EH14 4AS, Midlothian, Scotland
关键词
Data integration; Diffusion algorithm; Esophageal cancer; Gene regulatory network; Gene regulatory network inference; Random walk with restart; EXPRESSION; FUSION; BIOLOGY; CANCER;
D O I
10.1016/j.compbiolchem.2021.107589
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
One of the main research topics in computational biology is Gene Regulatory Network (GRN) reconstruction that refers to inferring the relationships between genes involved in regulating cell conditions in response to internal or external stimuli. To this end, most computational methods use only transcriptional gene expression data to reconstruct gene regulatory networks, but recent studies suggest that gene expression data must be integrated with other types of data to obtain more accurate models predicting real relationships between genes. In this study, a diffusion-based method is enhanced to integrate biological data of network types besides structural prior knowledge. The Random Walk with Restart algorithm (RWR) with an emphasis on hub nodes is executed separately on each network, and then jointly optimizes low-dimensional feature vectors for network nodes by diffusion component analysis. Next, these feature vectors are used to infer gene regulatory networks. Fourteen centrality measures are studied for the detection of hub nodes to be used in the RWR algorithm, and the best centrality measure having the greatest effect on the improvement of gene network inference is selected. A case study for the Saccharomyces cerevisiae and E. coli networks shows that using the proposed features in comparison with gene expression data alone results in 0.02-0.08 units improvement in Area Under Receiver Characteristic Operator (AUROC) criteria across different gene regulatory network inference methods. Furthermore, the proposed method was applied to the esophageal cancer data to infer its gene regulatory network. The proposed framework substantially improves accuracy and scalability of GRN inference. The fused features and the best centrality measure detected can be used to provide functional insights about genes or proteins in various biological applications. Moreover, it can be served as a general framework for network data and structural data integration and analysis problems in various scientific disciplines including biology.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Enhancing gene regulatory network inference through data integration with markov random fields
    Michael Banf
    Seung Y. Rhee
    [J]. Scientific Reports, 7
  • [2] Enhancing gene regulatory network inference through data integration with markov random fields
    Banf, Michael
    Rhee, Seung Y.
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [3] On Reliability Evaluation of Hub-Based Networks
    Chen, Shin-Guang
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2013), 2014, 293 : 1147 - 1153
  • [4] Integration of Steady-State and Temporal Gene Expression Data for the Inference of Gene Regulatory Networks
    Wang, Yi Kan
    Hurley, Daniel G.
    Schnell, Santiago
    Print, Cristin G.
    Crampin, Edmund J.
    [J]. PLOS ONE, 2013, 8 (08):
  • [5] Mining hub-based protein complexes In massrve biological networks
    Lin, Zhijie
    Chen, Yan
    Wu, Shiwei
    Xiong, Yun
    Zhu, Yangyong
    Zheng, Guangyong
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [6] A Hub-Based Labeling Algorithm for Shortest Paths in Road Networks
    Abraham, Ittai
    Delling, Daniel
    Goldberg, Andrew V.
    Werneck, Renato F.
    [J]. EXPERIMENTAL ALGORITHMS, 2011, 6630 : 230 - 241
  • [7] Inference of differential gene regulatory networks based on gene expression and genetic perturbation data
    Zhou, Xin
    Cai, Xiaodong
    [J]. BIOINFORMATICS, 2020, 36 (01) : 197 - 204
  • [8] Data-driven Gene Regulatory Networks Inference Based on Classification Algorithms
    Peignier, Sergio
    Schmitt, Pauline
    Calevro, Federica
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2021, 30 (04)
  • [9] Computational approaches to the integration of gene expression, ChIP-chip and sequence data in the inference of gene regulatory networks
    Cooke, Emma J.
    Savage, Richard S.
    Wild, David L.
    [J]. SEMINARS IN CELL & DEVELOPMENTAL BIOLOGY, 2009, 20 (07) : 863 - 868
  • [10] The inference of breast cancer metastasis through gene regulatory networks
    Ahmad, F. K.
    Deris, S.
    Othman, N. H.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2012, 45 (02) : 350 - 362