Peptide-based functional annotation of carbohydrate-active enzymes by conserved unique peptide patterns (CUPP)

被引:45
|
作者
Barrett, Kristian [1 ]
Lange, Lene [2 ]
机构
[1] Tech Univ Denmark, Dept Biotechnol & Biomed, Lyngby, Denmark
[2] BioEcon Res & Advisory, Valby, Denmark
关键词
Peptide pattern recognition; Automated protein clustering; Protein group creation; Automated functional protein annotation; Systemized genome enzyme discovery; MULTIPLE SEQUENCE ALIGNMENT; HYDROLASE FAMILY 30; DATABASE; SUBFAMILIES;
D O I
10.1186/s13068-019-1436-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundInsight into the function of carbohydrate-active enzymes is required to understand their biological role and industrial potential. There is a need for better use of the ample genomic data in order to enable selection of the most interesting proteins for further studies. The basis for elaborating a new approach to sequence analysis is the hypothesis that when using conserved peptide patterns to determine the similarities between proteins, the exact spacing between conserved adjacent amino acids in the proteins plays a prominent functional role. Thus, the objective of developing the method of conserved unique peptide patterns (CUPP) is to construct a peptide-based grouping and validate the method to provide evidence that CUPP captures function-related features of the individual carbohydrate-active enzymes (as defined by CAZy families). This approach facilitates grouping of enzymes at a level lower than protein families and/or subfamilies. A standardized, efficient, and robust approach to functional annotation of carbohydrate-active enzymes would support improved molecular insight into enzyme-substrate interaction.ResultsA new nonalignment-based clustering and functional annotation tool was developed that uses conserved unique peptides patterns to perform automated clustering of proteins and formation of protein groups. A peptide-based model was constructed for each of these protein CUPP groups to be used to automatically annotate protein family, subfamily, and EC function of carbohydrate-active enzymes. CUPP prediction can annotate proteins (from any CAZy family) with high F-score to existing family (0.966), subfamily (0.961), and EC-function (0.843). The speed of the CUPP program was estimated and exemplified by prediction of the 504,017 nonredundant proteins of CAZy in less than four CPU hours.ConclusionIt was possible to construct an automated system for clustering proteins within families and use the resulting CUPP groups to directly build peptide-based models for genome annotation. The CUPP runtime, F-score, sensitivity, and precisions of family and subfamily annotations match or represent an improvement compared to state-of-the-art tools. The speed of the CUPP annotation is similar to the rapid DIAMOND annotation tool. CUPP facilitates automated annotation of full genome assemblies to any CAZy family.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Functional Annotation of Fibrobacter succinogenes S85 Carbohydrate Active Enzymes
    Brumm, Phillip
    Mead, David
    Boyum, Julie
    Drinkwater, Colleen
    Gowda, Krishne
    Stevenson, David
    Weimer, Paul
    APPLIED BIOCHEMISTRY AND BIOTECHNOLOGY, 2011, 163 (05) : 649 - 657
  • [32] Functional Annotation of Fibrobacter succinogenes S85 Carbohydrate Active Enzymes
    Phillip Brumm
    David Mead
    Julie Boyum
    Colleen Drinkwater
    Krishne Gowda
    David Stevenson
    Paul Weimer
    Applied Biochemistry and Biotechnology, 2011, 163 : 649 - 657
  • [33] Modern Lipid-, Carbohydrate-, and Peptide-Based Delivery Systems for Peptide, Vaccine, and Gene Products
    Simerska, Pavla
    Moyle, Peter M.
    Toth, Istvan
    MEDICINAL RESEARCH REVIEWS, 2011, 31 (04) : 520 - 547
  • [34] A cyclic peptide-based redox-active model of rubredoxin
    Jacques, A.
    Blondin, G.
    Seneque, O.
    Latour, J.
    Jacques, A.
    Clemancey, M.
    Fourmond, V.
    JOURNAL OF BIOLOGICAL INORGANIC CHEMISTRY, 2014, 19 : S471 - S471
  • [35] A cyclic peptide-based redox-active model of rubredoxin
    Jacques, Aurelie
    Clemancey, Martin
    Blondin, Genevieve
    Fourmond, Vincent
    Latour, Jean-Marc
    Seneque, Olivier
    CHEMICAL COMMUNICATIONS, 2013, 49 (28) : 2915 - 2917
  • [36] RENIN INHIBITORS - A PARADIGM FOR PEPTIDE-BASED ORALLY ACTIVE COMPOUNDS
    KLEINERT, HD
    BAKER, WR
    ROSENBERG, SH
    STEIN, HH
    JOURNAL OF CELLULAR BIOCHEMISTRY, 1993, : 210 - 210
  • [37] Peptide-Based Star Polymers: The Rising Star in Functional Polymers
    Sulistio, Adrian
    Gurr, Paul A.
    Blencowe, Anton
    Qiao, Greg G.
    AUSTRALIAN JOURNAL OF CHEMISTRY, 2012, 65 (08) : 978 - 984
  • [38] Erratum: Functional Annotation of Fibrobacter succinogenes S85 Carbohydrate Active Enzymes
    Phillip Brumm
    David Mead
    Julie Boyum
    Colleen Drinkwater
    Jan Deneke
    Krishne Gowda
    David Stevenson
    Paul Weimer
    Applied Biochemistry and Biotechnology, 2011, 163 : 692 - 692
  • [39] Peptide-Based Functional Biomaterials for Soft-Tissue Repair
    Hosoyama, Katsuhiro
    Lazurko, Caitlin
    Munoz, Marcelo
    McTiernan, Christopher D.
    Alarcon, Emilio, I
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2019, 7
  • [40] Design of Functional RGD Peptide-Based Biomaterials for Tissue Engineering
    Kumar, Vijay Bhooshan
    Tiwari, Om Shanker
    Finkelstein-Zuta, Gal
    Rencus-Lazar, Sigal
    Gazit, Ehud
    PHARMACEUTICS, 2023, 15 (02)