Structure-aware annotation of leucine-rich repeat domains

被引:0
|
作者
Xu, Boyan [1 ,2 ]
Cerbu, Alois [2 ]
Tralie, Christopher J. [3 ]
Lim, Daven [4 ]
Krasileva, Ksenia [1 ,4 ]
机构
[1] Univ Calif Berkeley, Ctr Computat Biol, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Math, Berkeley, CA USA
[3] Ursinus Coll, Dept Math & Comp Sci, Collegeville, PA USA
[4] Univ Calif Berkeley, Dept Plant & Microbial Biol, Berkeley, CA 94720 USA
关键词
CURVATURE; FAMILIES;
D O I
10.1371/journal.pcbi.1012526
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein domain annotation is typically done by predictive models such as HMMs trained on sequence motifs. However, sequence-based annotation methods are prone to error, particularly in calling domain boundaries and motifs within them. These methods are limited by a lack of structural information accessible to the model. With the advent of deep learning-based protein structure prediction, existing sequenced-based domain annotation methods can be improved by taking into account the geometry of protein structures. We develop dimensionality reduction methods to annotate repeat units of the Leucine Rich Repeat solenoid domain. The methods are able to correct mistakes made by existing machine learning-based annotation tools and enable the automated detection of hairpin loops and structural anomalies in the solenoid. The methods are applied to 127 predicted structures of LRR-containing intracellular innate immune proteins in the model plant Arabidopsis thaliana and validated against a benchmark dataset of 172 manually-annotated LRR domains.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Leucine-Rich Repeat Receptor-Like Proteins in Plants: Structure, Function, and Signaling
    Cho, Hyewon
    Lee, Jungmo
    Oh, Eunkyoo
    JOURNAL OF PLANT BIOLOGY, 2023, 66 (02) : 99 - 107
  • [32] Sequence-structure-function relations of the mosquito leucine-rich repeat immune proteins
    Robert M Waterhouse
    Michael Povelones
    George K Christophides
    BMC Genomics, 11
  • [33] THE PRIMARY STRUCTURE OF A BASIC LEUCINE-RICH REPEAT PROTEIN, PRELP, FOUND IN CONNECTIVE TISSUES
    BENGTSSON, E
    NEAME, PJ
    HEINEGARD, D
    SOMMARIN, Y
    JOURNAL OF BIOLOGICAL CHEMISTRY, 1995, 270 (43) : 25639 - 25644
  • [34] Sequence-structure-function relations of the mosquito leucine-rich repeat immune proteins
    Waterhouse, Robert M.
    Povelones, Michael
    Christophides, George K.
    BMC GENOMICS, 2010, 11
  • [35] A conserved gene family encodes transmembrane proteins with fibronectin, immunoglobulin and leucine-rich repeat domains (FIGLER)
    Munfus, Delicia L.
    Haga, Christopher L.
    Burrows, Peter D.
    Cooper, Max D.
    BMC BIOLOGY, 2007, 5 (1)
  • [36] A conserved gene family encodes transmembrane proteins with fibronectin, immunoglobulin and leucine-rich repeat domains (FIGLER)
    Delicia L Munfus
    Christopher L Haga
    Peter D Burrows
    Max D Cooper
    BMC Biology, 5
  • [37] Improved annotation of the nucleotide-binding domain leucine-rich repeat containing NLR gene family in citrus
    Zheng, Qiaolin
    Pitino, Marco
    Saha, Surya
    Flores, Mirella
    Hosmani, Prashant
    Mueller, Lukas
    Cano, Liliana M.
    PHYTOPATHOLOGY, 2019, 109 (09)
  • [38] Structure and regulation of full-length human leucine-rich repeat kinase 1
    Riley D. Metcalfe
    Juliana A. Martinez Fiesco
    Luis Bonet-Ponce
    Jillian H. Kluss
    Mark R. Cookson
    Ping Zhang
    Nature Communications, 14
  • [39] Leucine-Rich Repeat Genes and the Fine-Tuning of Synapses
    Francks, Clyde
    BIOLOGICAL PSYCHIATRY, 2011, 69 (09) : 820 - 821
  • [40] Identification of epitopes in Leptospira borgpetersenii leucine-rich repeat proteins
    Nitipan, Supachai
    Sritrakul, Tepyuda
    Kunjantarachot, Anthicha
    Prapong, Siriwan
    INFECTION GENETICS AND EVOLUTION, 2013, 14 : 46 - 57