Novel function discovery through sequence and structural data mining

被引:26
|
作者
Lobb, Briallen [1 ]
Doxey, Andrew C. [1 ]
机构
[1] Univ Waterloo, Dept Biol, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
PROTEIN-PROTEIN INTERACTIONS; LARGE-SCALE; LINEAR MOTIFS; COMPLETE NITRIFICATION; STRUCTURE PREDICTION; ENZYME; EVOLUTION; SPECIFICITY; BACTERIA; SURFACE;
D O I
10.1016/j.sbi.2016.05.017
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large-scale sequence and structural data is a goldmine of novel proteins, but how can this data be effectively mined for new functions? Here, we review protein function prediction methods and recent studies that apply these methods to discover new functionality. Core approaches include sequence-based homology detection, phylogenetic analysis, structural bioinformatics, and inference of functional associations using genomic context and related methods. With such a wide range of approaches, sequences may reveal new functionality regardless of their similarity to a characterized reference. Homologs of known function may be identified in unexpected species or associations. Detection of functional shifts in sequences may reveal new activities and specificities. New protein functions may also be predicted in uncharacterized sequences and structures. Finally, methods and data may be integrated and applied at increasingly large scales due to improved protein domain knowledge and structural coverage, which amplifies the ability to predict and discover novel protein functions.
引用
收藏
页码:53 / 61
页数:9
相关论文
共 50 条
  • [1] Linkage Discovery Through Data Mining
    Ting, Chuan-Kang
    Zeng, Wei-Ming
    Lin, Tzu-Chieh
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (01) : 10 - 13
  • [2] A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes
    Shoshana Brown
    Jean l. Chang
    Wolfgang Sadee
    Patricia C. Babbitt
    AAPS PharmSci, 5
  • [3] A semiautomated approach to gene discovery through expressed sequence tag data mining: Discovery of new human transporter genes
    Brown, S
    Chang, JL
    Sadee, W
    Babbitt, PC
    AAPS PHARMSCI, 2003, 5 (01):
  • [4] A novel methodology for knowledge discovery through mining associations between building operational data
    Yu, Zhun
    Haghighat, Fariborz
    Fung, Benjamin C. M.
    Zhou, Liang
    ENERGY AND BUILDINGS, 2012, 47 : 430 - 440
  • [5] DATA SCIENCE AND KNOWLEDGE DISCOVERY THROUGH DATA MINING PARADIGMS
    Chhabra, Indu
    Suri, Gunmala
    JOURNAL OF MECHANICS OF CONTINUA AND MATHEMATICAL SCIENCES, 2019, 14 (02): : 167 - 173
  • [6] Data mining: Efficiency of using sequence databases for polymorphism discovery
    Cox, DG
    Boillot, C
    Canzian, F
    HUMAN MUTATION, 2001, 17 (02) : 141 - 150
  • [7] The discovery of patterns through data mining at IPTU in Curitiba
    Costa, Ana Paula
    Pecini, Andre Custodio
    Tsunoda, Denise Fukumi
    REVISTA DO SERVICO PUBLICO, 2021, 72 (04): : 753 - 778
  • [8] Tourism Knowledge Discovery through Data Mining Techniques
    Jamil, Jastini Mohd
    Shaharanee, Izwan Nizal Mohd
    4TH INNOVATION AND ANALYTICS CONFERENCE & EXHIBITION (IACE 2019), 2019, 2138
  • [9] Knowledge discovery through mining process operational data
    Wang, XZ
    APPLICATION OF NEURAL NETWORKS AND OTHER LEARNING TECHNOLOGIES IN PROCESS ENGINEERING, 2001, : 287 - 328
  • [10] Structural and functional analysis of L-methionine oxidase identified through sequence data mining
    Kawamura, Yui
    Sugiura, Sayaka
    Araseki, Hayato
    Chisuga, Taichi
    Nakano, Shogo
    JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2024, 138 (05) : 391 - 398