Identifying functions of protein complexes based on topology similarity with random forest

被引:4
|
作者
Li, Zhan-Chao [1 ]
Lai, Yan-Hua [2 ]
Chen, Li-Li [2 ]
Xie, Yun [1 ]
Dai, Zong [2 ]
Zou, Xiao-Yong [2 ]
机构
[1] Guangdong Pharmaceut Univ, Sch Chem & Chem Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Chem & Chem Engn, Guangzhou 510275, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
INTERACTION NETWORKS; COMMUNITY STRUCTURE; PREDICTION; CLASSIFICATION; ALGORITHM; DISCOVERY; RESOURCE;
D O I
10.1039/c3mb70401g
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Elucidating the functions of protein complexes is critical for understanding disease mechanisms, diagnosis and therapy. In this study, based on the concept that protein complexes with similar topology may have similar functions, we firstly model protein complexes as weighted graphs with nodes representing the proteins and edges indicating interaction between proteins. Secondly, we use topology features derived from the graphs to characterize protein complexes based on the graph theory. Finally, we construct a predictor by using random forest and topology features to identify the functions of protein complexes. Effectiveness of the current method is evaluated by identifying the functions of mammalian protein complexes. And then the predictor is also utilized to identify the functions of protein complexes retrieved from human protein-protein interaction networks. We identify some protein complexes with significant roles in the occurrence of tumors, vesicles and retinoblastoma. It is anticipated that the current research has an important impact on pathogenesis and the pharmaceutical industry. The source code of Matlab and the dataset are freely available on request from the authors.
引用
收藏
页码:514 / 525
页数:12
相关论文
共 50 条
  • [21] Framework to Identify Protein Complexes Based on Similarity Preclustering
    Peng, Xiaoqing
    Yan, Xiaodong
    Wang, Jianxin
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2017, 22 (01) : 42 - 51
  • [22] Topology of random simplicial complexes: a survey
    Kahle, Matthew
    [J]. ALGEBRAIC TOPOLOGY: APPLICATIONS AND NEW DIRECTIONS, 2014, 620 : 201 - 221
  • [23] Topology of Random 2-Complexes
    Cohen, D.
    Costa, A.
    Farber, M.
    Kappeler, T.
    [J]. DISCRETE & COMPUTATIONAL GEOMETRY, 2012, 47 (01) : 117 - 149
  • [24] Topology of random geometric complexes: a survey
    Bobrowski O.
    Kahle M.
    [J]. Journal of Applied and Computational Topology, 2018, 1 (3-4) : 331 - 364
  • [25] ANATOMICAL STRUCTURE SIMILARITY ESTIMATION BY RANDOM FOREST
    Pei, Yuru
    Kou, Lei
    Zha, Hongbin
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 2941 - 2945
  • [26] Random forest based similarity learning for single cell RNA sequencing data
    Pouyan, Maziyar Baran
    Kostka, Dennis
    [J]. BIOINFORMATICS, 2018, 34 (13) : 79 - 88
  • [27] WiFi-Based Indoor Positioning by Random Forest and Adjusted Cosine Similarity
    Gao, Jieyu
    Li, Xiaohui
    Ding, Yuemin
    Su, Qian
    Liu, Zhenxing
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 1426 - 1431
  • [28] Identifying feature relevance using a random forest
    Rogers, Jeremy
    Gunn, Steve
    [J]. SUBSPACE, LATENT STRUCTURE AND FEATURE SELECTION, 2006, 3940 : 173 - 184
  • [29] Reconstructing the topology of protein complexes
    Bernard, Allister
    Vaughn, David S.
    Hartemink, Alexander J.
    [J]. RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2007, 4453 : 32 - +
  • [30] Pippin: A random forest-based method for identifying presynaptic and postsynaptic neurotoxins
    Li, Pengyu
    Zhang, He
    Zhao, Xuyang
    Jia, Cangzhi
    Li, Fuyi
    Song, Jiangning
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2020, 18 (02)