Peptide-based functional annotation of carbohydrate-active enzymes by conserved unique peptide patterns (CUPP)

被引:45
|
作者
Barrett, Kristian [1 ]
Lange, Lene [2 ]
机构
[1] Tech Univ Denmark, Dept Biotechnol & Biomed, Lyngby, Denmark
[2] BioEcon Res & Advisory, Valby, Denmark
关键词
Peptide pattern recognition; Automated protein clustering; Protein group creation; Automated functional protein annotation; Systemized genome enzyme discovery; MULTIPLE SEQUENCE ALIGNMENT; HYDROLASE FAMILY 30; DATABASE; SUBFAMILIES;
D O I
10.1186/s13068-019-1436-5
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundInsight into the function of carbohydrate-active enzymes is required to understand their biological role and industrial potential. There is a need for better use of the ample genomic data in order to enable selection of the most interesting proteins for further studies. The basis for elaborating a new approach to sequence analysis is the hypothesis that when using conserved peptide patterns to determine the similarities between proteins, the exact spacing between conserved adjacent amino acids in the proteins plays a prominent functional role. Thus, the objective of developing the method of conserved unique peptide patterns (CUPP) is to construct a peptide-based grouping and validate the method to provide evidence that CUPP captures function-related features of the individual carbohydrate-active enzymes (as defined by CAZy families). This approach facilitates grouping of enzymes at a level lower than protein families and/or subfamilies. A standardized, efficient, and robust approach to functional annotation of carbohydrate-active enzymes would support improved molecular insight into enzyme-substrate interaction.ResultsA new nonalignment-based clustering and functional annotation tool was developed that uses conserved unique peptides patterns to perform automated clustering of proteins and formation of protein groups. A peptide-based model was constructed for each of these protein CUPP groups to be used to automatically annotate protein family, subfamily, and EC function of carbohydrate-active enzymes. CUPP prediction can annotate proteins (from any CAZy family) with high F-score to existing family (0.966), subfamily (0.961), and EC-function (0.843). The speed of the CUPP program was estimated and exemplified by prediction of the 504,017 nonredundant proteins of CAZy in less than four CPU hours.ConclusionIt was possible to construct an automated system for clustering proteins within families and use the resulting CUPP groups to directly build peptide-based models for genome annotation. The CUPP runtime, F-score, sensitivity, and precisions of family and subfamily annotations match or represent an improvement compared to state-of-the-art tools. The speed of the CUPP annotation is similar to the rapid DIAMOND annotation tool. CUPP facilitates automated annotation of full genome assemblies to any CAZy family.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Functional Carbohydrate-Active Enzymes Acquired by Horizontal Gene Transfer from Plants in the Whitefly Bemisia tabaci
    Colinet, Dominique
    Haon, Mireille
    Drula, Elodie
    Boyer, Mathilde
    Grisel, Sacha
    Belliardo, Carole
    Koutsovoulos, Georgios D.
    Berrin, Jean-Guy
    Danchin, Etienne G. J.
    GENOME BIOLOGY AND EVOLUTION, 2025, 17 (02):
  • [42] A New Versatile Microarray-based Method for High Throughput Screening of Carbohydrate-active Enzymes
    Vidal-Melgosa, Silvia
    Pedersen, Henriette L.
    Schuckel, Julia
    Arnal, Gregory
    Dumon, Claire
    Amby, Daniel B.
    Monrad, Rune Nygaard
    Westereng, Bjorge
    Willats, William G. T.
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2015, 290 (14) : 9020 - 9036
  • [43] Conserved Binding Regions Provide the Clue for Peptide-Based Vaccine Development: A Chemical Perspective
    Curtidor, Hernando
    Reyes, Cesar
    Bermudez, Adriana
    Vanegas, Magnolia
    Varela, Yahson
    Patarroyo, Manuel E.
    MOLECULES, 2017, 22 (12):
  • [44] Functional Integration of DNA and Peptide-Based Supramolecular Nanoassemblies for Cancer Therapy
    Dong, Yuhang
    Guo, Yunhua
    Song, Wenzhe
    Nie, Guangjun
    Li, Feng
    ACCOUNTS OF MATERIALS RESEARCH, 2023, 4 (10): : 892 - 905
  • [45] Hydrophobic nanofibers: a peptide-based functional anti-fouling material
    Hati, Kshitish Chandra
    Kumar, Santosh
    Mondal, Sahabaj
    Singh, Surajit
    Shit, Ananda
    Nandi, Sujay Kumar
    Haldar, Debasish
    MATERIALS ADVANCES, 2022, 3 (10): : 4194 - 4199
  • [46] Peptide-Based Ligand Screening and Functional Analysis of Protein Kinase C
    Ohashi, Nami
    Nomura, Wataru
    Narumi, Tetsuo
    Tamamura, Hirokazu
    BIOPOLYMERS, 2013, 100 (06) : 613 - 620
  • [47] Rational design of peptide-based functional biomaterials via multiscale modeling
    Hung Nguyen
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [48] Nanotechnology Meets Biology: Peptide-based Methods for the Fabrication of Functional Materials
    Briggs, Beverly D.
    Knecht, Marc R.
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2012, 3 (03): : 405 - 418
  • [49] Transcriptome Profiling-Based Analysis of Carbohydrate-Active Enzymes inAspergillus terreusInvolved in Plant Biomass Degradation
    Correa, Camila L.
    Midorikawa, Glaucia E. O.
    Ferreira Filho, Edivaldo Ximenes
    Noronha, Eliane Ferreira
    Alves, Gabriel S. C.
    Togawa, Roberto Coiti
    Silva, Orzenil Bonfim, Jr.
    do Carmo Costa, Marcos Mota
    Grynberg, Priscila
    Miller, Robert N. G.
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8 (08):
  • [50] Direct Structural Annotation of Membrane Protein Aggregation Loci using Peptide-Based Reverse Mapping
    Lella, Muralikrishna
    Mahalakshmi, Radhakrishnan
    JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2018, 9 (11): : 2967 - 2971