Peptide-based functional annotation of carbohydrate-active enzymes by conserved unique peptide patterns (CUPP)

被引:45
|
作者
Barrett, Kristian [1 ]
Lange, Lene [2 ]
机构
[1] Tech Univ Denmark, Dept Biotechnol & Biomed, Lyngby, Denmark
[2] BioEcon Res & Advisory, Valby, Denmark
关键词
Peptide pattern recognition; Automated protein clustering; Protein group creation; Automated functional protein annotation; Systemized genome enzyme discovery; MULTIPLE SEQUENCE ALIGNMENT; HYDROLASE FAMILY 30; DATABASE; SUBFAMILIES;
D O I
10.1186/s13068-019-1436-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundInsight into the function of carbohydrate-active enzymes is required to understand their biological role and industrial potential. There is a need for better use of the ample genomic data in order to enable selection of the most interesting proteins for further studies. The basis for elaborating a new approach to sequence analysis is the hypothesis that when using conserved peptide patterns to determine the similarities between proteins, the exact spacing between conserved adjacent amino acids in the proteins plays a prominent functional role. Thus, the objective of developing the method of conserved unique peptide patterns (CUPP) is to construct a peptide-based grouping and validate the method to provide evidence that CUPP captures function-related features of the individual carbohydrate-active enzymes (as defined by CAZy families). This approach facilitates grouping of enzymes at a level lower than protein families and/or subfamilies. A standardized, efficient, and robust approach to functional annotation of carbohydrate-active enzymes would support improved molecular insight into enzyme-substrate interaction.ResultsA new nonalignment-based clustering and functional annotation tool was developed that uses conserved unique peptides patterns to perform automated clustering of proteins and formation of protein groups. A peptide-based model was constructed for each of these protein CUPP groups to be used to automatically annotate protein family, subfamily, and EC function of carbohydrate-active enzymes. CUPP prediction can annotate proteins (from any CAZy family) with high F-score to existing family (0.966), subfamily (0.961), and EC-function (0.843). The speed of the CUPP program was estimated and exemplified by prediction of the 504,017 nonredundant proteins of CAZy in less than four CPU hours.ConclusionIt was possible to construct an automated system for clustering proteins within families and use the resulting CUPP groups to directly build peptide-based models for genome annotation. The CUPP runtime, F-score, sensitivity, and precisions of family and subfamily annotations match or represent an improvement compared to state-of-the-art tools. The speed of the CUPP annotation is similar to the rapid DIAMOND annotation tool. CUPP facilitates automated annotation of full genome assemblies to any CAZy family.
引用
收藏
页数:21
相关论文
共 50 条
  • [11] Conserved unique peptide patterns (CUPP) online platform 2.0: implementation of+1000 JGI fungal genomes
    Barrett, Kristian
    Hunt, Cameron J.
    Lange, Lene
    Grigoriev, Igor, V
    Meyer, Anne S.
    NUCLEIC ACIDS RESEARCH, 2023, 51 (W1) : W108 - W114
  • [12] Peptide-based functional nanostructures
    Sarikaya, Mehmet
    Tamerler, Candan
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2009, 238
  • [13] Carbohydrate recognition Peptide-based biomimetic receptors
    Rodriguez, Maria C.
    Cudic, Predrag
    CHIMICA OGGI-CHEMISTRY TODAY, 2011, 29 (02) : 36 - 39
  • [14] Peptide-based functional molecular gels
    Escuder Gil, Beatriu
    Miravet, Juan F.
    Tena-Solsona, Marta
    Berdugo, Cristina
    Diaz-Oltra, Santiago
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 249
  • [15] Genome-wide annotation, comparison and functional genomics of carbohydrate-active enzymes in legumes infecting Fusarium oxysporum formae speciales
    Roy, Abhijeet
    Jayaprakash, Aiswarya
    Rajeswary, Raja
    Annamalai, A.
    Lakshmi, P. T., V
    MYCOLOGY-AN INTERNATIONAL JOURNAL ON FUNGAL BIOLOGY, 2020, 11 (01) : 56 - 70
  • [16] Functional Diversity of Carbohydrate-Active Enzymes Enabling a Bacterium to Ferment Plant Biomass
    Boutard, Magali
    Cerisy, Tristan
    Nogue, Pierre-Yves
    Alberti, Adriana
    Weissenbach, Jean
    Salanoubat, Marcel
    Tolonen, Andrew C.
    PLOS GENETICS, 2014, 10 (11):
  • [17] Helix triangle: Unique peptide-based molecular architecture
    Yoshida, Kentaro
    Kawamura, Shin-Ichi
    Morita, Tomoyuki
    Kimura, Shunsaku
    JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2006, 128 (24) : 8034 - 8041
  • [18] Helix triangle: Unique peptide-based molecular architecture
    Yoshida, Kentaro
    Kawamura, Shin-Ichi
    Morita, Tomoyuki
    Kimura, Shunsaku
    Journal of the American Chemical Society, 2006, 128 (24): : 8034 - 8041
  • [19] Development of peptide-based patterns by laser transfer
    Dinca, V.
    Kasotakis, E.
    Catherine, J.
    Mourka, A.
    Mitraki, A.
    Popescu, A.
    Dinescu, M.
    Farsari, M.
    Fotakis, C.
    APPLIED SURFACE SCIENCE, 2007, 254 (04) : 1160 - 1163
  • [20] Engineering peptide-based biomimetic enzymes for enhanced catalysis
    Zhang, Guohua
    Huang, Renliang
    Qi, Wei
    Wang, Yuefei
    Su, Rongxin
    He, Zhimin
    RSC ADVANCES, 2016, 6 (47): : 40828 - 40834