Adapting Community Detection Algorithms for Disease Module Identification in Heterogeneous Biological Networks

被引:38
|
作者
Tripathi, Beethika [1 ,2 ,3 ]
Parthasarathy, Srinivasan [4 ,5 ]
Sinha, Himanshu [2 ,3 ,6 ]
Raman, Karthik [2 ,3 ,6 ]
Ravindran, Balaraman [1 ,2 ,3 ]
机构
[1] Indian Inst Technol Madras, Dept Comp Sci & Engn, Chennai, Tamil Nadu, India
[2] Indian Inst Technol Madras, Initiat Biol Syst Engn, Chennai, Tamil Nadu, India
[3] Indian Inst Technol Madras, Robert Bosch Ctr Data Sci & AI, Chennai, Tamil Nadu, India
[4] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[5] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[6] Indian Inst Technol Madras, Bhupat & Jyoti Mehta Sch Biosci, Dept Biotechnol, Chennai, Tamil Nadu, India
关键词
overlapping community detection; non-overlapping community detection; disease module identification; biological networks; heterogeneous networks; PROTEIN; PATHWAYS; MEDICINE;
D O I
10.3389/fgene.2019.00164
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Biological networks catalog the complex web of interactions happening between different molecules, typically proteins, within a cell. These networks are known to be highly modular, with groups of proteins associated with specific biological functions. Human diseases often arise from the dysfunction of one or more such proteins of the biological functional group. The ability, to identify and automatically extract these modules has implications for understanding the etiology of different diseases as well as the functional roles of different protein modules in disease. The recent DREAM challenge posed the problem of identifying disease modules from six heterogeneous networks of proteins/genes. There exist many community detection algorithms, but all of them are not adaptable to the biological context, as these networks are densely connected and the size of biologically relevant modules is quite small. The contribution of this study is 3-fold: first, we present a comprehensive assessment of many classic community detection algorithms for biological networks to identify non-overlapping communities, and propose heuristics to identify small and structurally well-defined communities-core modules. We evaluated our performance over 180 GWAS datasets. In comparison to traditional approaches, with our proposed approach we could identify 50% more number of disease-relevant modules. Thus, we show that it is important to identify more compact modules for better performance. Next, we sought to understand the peculiar characteristics of disease-enriched modules and what causes standard community detection algorithms to detect so few of them. We performed a comprehensive analysis of the interaction patterns of known disease genes to understand the structure of disease modules and show that merely considering the known disease genes set as a module does not give good quality clusters, as measured by typical metrics such as modularity and conductance. We go on to present a methodology leveraging these known disease genes, to also include the neighboring nodes of these genes into a module, to form good quality clusters and subsequently extract a "gold-standard set" of disease modules. Lastly, we demonstrate, with justification, that "overlapping" community detection algorithms should be the preferred choice for disease module identification since several genes participate in multiple biological functions.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Differential Community Detection in Paired Biological Networks
    Mall, Raghvendra
    Ullah, Ehsan
    Kunji, Khalid
    D'Angelo, Fulvio
    Bensmail, Halima
    Ceccarelli, Michele
    ACM-BCB' 2017: PROCEEDINGS OF THE 8TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY,AND HEALTH INFORMATICS, 2017, : 330 - 339
  • [32] Genetic Algorithm for Community Detection in Biological Networks
    Ben M'Barek, Marwa
    Borgi, Amel
    Bedhiafi, Walid
    Ben Hmida, Sana
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 195 - 204
  • [33] Bacterial Biological Mechanisms for Functional Module Detection in PPI Networks
    Yang, Cuicui
    Ji, Junzhong
    Zhang, Aidong
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 318 - 323
  • [34] Random matrix improved community detection in heterogeneous networks
    Ali, Hafiz Tiomoko
    Couillet, Romain
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 1385 - 1389
  • [35] Improved spectral community detection in large heterogeneous networks
    Tiomoko Ali, Hafiz
    Couillet, Romain
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18
  • [36] SPECTRAL ALGORITHM FOR COMMUNITY DETECTION UNDER HETEROGENEOUS NETWORKS
    Zhou, Qiang
    Cai, Shi-Min
    Zhang, Yi-Cheng
    2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP), 2019, : 76 - 80
  • [37] Research Review on Algorithms of Community Detection in Complex Networks
    Zhang, Jingyi
    Ma, Zhixin
    Sun, Qijuan
    Yan, Jun
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2018), 2018, 1069
  • [38] An Analysis of Overlapping Community Detection Algorithms in Social Networks
    Devi, J. Chitra
    Poovammal, E.
    TWELFTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2016 / TWELFTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2016 / TWELFTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2016, 2016, 89 : 349 - 358
  • [39] An Analysis of the Dynamic Community Detection Algorithms in Complex Networks
    Singh, Dhananjay Kumar
    Debnath, Narayan C.
    Haraty, Ramzi A.
    Choudhury, Prasenjit
    2020 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2020, : 989 - 994
  • [40] A Scheme to Design Community Detection Algorithms in Various Networks
    Lu, Haoye
    Nayak, Amiya
    FUTURE INTERNET, 2019, 11 (02):