Proteome-wide functional classification and identification of prokaryotic transmembrane proteins by transmembrane topology similarity comparison

被引:10
|
作者
Arai, M
Okumura, K
Satake, M
Shimizu, T
机构
[1] Hirosaki Univ, Fac Sci & Technol, Dept Elect & Informat Syst Engn, Hirosaki, Aomori 0368561, Japan
[2] Tohoku Univ, Grad Sch Life Sci, Dept Dev Biol & Neurosci, Sendai, Miyagi 9808577, Japan
[3] Tohoku Univ, Inst Dev Aging & Canc, Dept Mol Immunol, Sendai, Miyagi 9808577, Japan
关键词
transmembrane protein; transmembrane topology similarity; functional classification and identification; proteome-wide analysis; prokaryotic genome;
D O I
10.1110/ps.04814404
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We propose a new method for classifying and identifying transmembrane (TM) protein functions in proteome-scale by applying a single-linkage clustering method based on TM topology similarity, which is calculated simply from comparing the lengths of loop regions. In this study, we focused on 87 prokaryotic TM proteomes consisting of 31 proteobacteria, 22 gram-positive bacteria, 19 other bacteria, and 15 archaea. Prior to performing the clustering, we first categorized individual TM protein sequences as "known," "putative" (similar to "known" sequences), or "unknown" by using the homology search and the sequence similarity comparison against SWISS-PROT to assess the current status of the functional annotation of the TM proteomes based on sequence similarity only. More than three-quarters, that is, 75.7% of the TM protein sequences are functionally "unknown," with only 3.8% and 20.5% of them being classified as "known" and "putative," respectively. Using our clustering approach based on TM topology similarity, we succeeded in increasing the rate of TM protein sequences functionally classified and identified from 24.3% to 60.9%. Obtained clusters correspond well to functional superfamilies or families, and the functional classification and identification are successfully achieved by this approach. For example, in an obtained cluster of TM proteins with six TM segments, 109 sequences out of 119 sequences annotated as "ATP-binding cassette transporter" are properly included and 122 "unknown" sequences are also contained.
引用
收藏
页码:2170 / 2183
页数:14
相关论文
共 50 条
  • [1] A proteome-wide analysis of domain architectures of prokaryotic single-spanning transmembrane proteins
    Arai, M
    Fukushi, T
    Satake, M
    Shimizu, T
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2005, 29 (05) : 379 - 387
  • [2] Proteome-Wide Modeling of Transmembrane Alpha-Helical Homodimers by TMDOCK
    Lomize, Andrei L.
    Pogozheva, Irina D.
    BIOPHYSICAL JOURNAL, 2017, 112 (03) : 358A - 358A
  • [3] Proteome-wide identification and functional analysis of ubiquitinated proteins in peach leaves
    Song, Yanbo
    Shi, Xiaojing
    Zou, Yanli
    Guo, Juanru
    Huo, Nan
    Chen, Shuangjian
    Zhao, Chengping
    Li, Hong
    Wu, Guoliang
    Peng, Yong
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [4] Proteome-wide classification and identification of mammalian-type GPCRs by binary topology pattern
    Inoue, Y
    Ikeda, M
    Shimizu, J
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (01) : 39 - 49
  • [5] Proteome-wide identification of palmitoylated proteins in mouse testis
    Gao, Jun
    Li, Wenchao
    Zhang, Zhongjian
    Gao, Wenshan
    Kong, Eryan
    REPRODUCTIVE SCIENCES, 2022, 29 (08) : 2299 - 2309
  • [6] Proteome-wide identification of palmitoylated proteins in mouse testis
    Jun Gao
    Wenchao Li
    Zhongjian Zhang
    Wenshan Gao
    Eryan Kong
    Reproductive Sciences, 2022, 29 : 2299 - 2309
  • [7] Author Correction: Proteome-wide identification and functional analysis of ubiquitinated proteins in peach leaves
    Yanbo Song
    Xiaojing Shi
    Yanli Zou
    Juanru Guo
    Nan Huo
    Shuangjian Chen
    Chengping Zhao
    Hong Li
    Guoliang Wu
    Yong Peng
    Scientific Reports, 10
  • [8] Chemical methods for the proteome-wide identification of posttranslationally modified proteins
    Chuh, Kelly N.
    Pratt, Matthew R.
    CURRENT OPINION IN CHEMICAL BIOLOGY, 2015, 24 : 27 - 37
  • [9] Transmembrane proteins in the Protein Data Bank:: identification and classification
    Tusnády, GE
    Dosztányi, Z
    Simon, I
    BIOINFORMATICS, 2004, 20 (17) : 2964 - 2972
  • [10] Proteome-Wide Identification of Lysine Succinylation in the Proteins of Tomato (Solanum lycopersicum)
    Jin, Weibo
    Wu, Fangli
    PLOS ONE, 2016, 11 (02):