Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins

被引:87
|
作者
Simon, Michelle [1 ]
Hancock, John M. [1 ]
机构
[1] MRC Harwell, Bioinformat Grp, Mammalian Genet Unit, Harwell OX11 ORD, Oxon, England
来源
GENOME BIOLOGY | 2009年 / 10卷 / 06期
基金
英国医学研究理事会;
关键词
INTRINSICALLY UNSTRUCTURED PROTEINS; SIMPLE SEQUENCE REPEATS; TRINUCLEOTIDE REPEATS; COMPARATIVE GENOMICS; POLYGLUTAMINE TRACT; NETWORK EVOLUTION; DOMAINS; DISEASE; LONG; GENE;
D O I
10.1186/gb-2009-10-6-r59
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Amino acid repeats (AARs) are common features of protein sequences. They often evolve rapidly and are involved in a number of human diseases. They also show significant associations with particular Gene Ontology (GO) functional categories, particularly transcription, suggesting they play some role in protein function. It has been suggested recently that AARs play a significant role in the evolution of intrinsically unstructured regions (IURs) of proteins. We investigate the relationship between AAR frequency and evolution and their localization within proteins based on a set of 5,815 orthologous proteins from four mammalian (human, chimpanzee, mouse and rat) and a bird (chicken) genome. We consider two classes of AAR (tandem repeats and cryptic repeats: regions of proteins containing overrepresentations of short amino acid repeats). Results: Mammals show very similar repeat frequencies but chicken shows lower frequencies of many of the cryptic repeats common in mammals. Regions flanking tandem AARs evolve more rapidly than the rest of the protein containing the repeat and this phenomenon is more pronounced for non-conserved repeats than for conserved ones. GO associations are similar to those previously described for the mammals, but chicken cryptic repeats show fewer significant associations. Comparing the overlaps of AARs with IURs and protein domains showed that up to 96% of some AAR types are associated preferentially with IURs. However, no more than 15% of IURs contained an AAR. Conclusions: Their location within IURs explains many of the evolutionary properties of AARs. Further study is needed on the types of IURs containing AARs.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Tandem and cryptic amino acid repeats accumulate in disordered regions of proteins
    Michelle Simon
    John M Hancock
    [J]. Genome Biology, 10
  • [2] ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins
    Luo, Hong
    Lin, Ke
    David, Audrey
    Nijveen, Harm
    Leunissen, Jack A. M.
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D394 - D399
  • [3] Identifying disordered regions in proteins from amino acid sequence
    Romero, P
    Obradovic, Z
    Kissinger, C
    Villafranca, JE
    Dunker, AK
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-4, 1997, : 90 - 95
  • [4] Biased Distribution of Amino Acid in Intrinsically Disordered Proteins and Regions
    Ding, Zhengyu
    Feng, Tian
    Nan, Fangbo
    Wang, Yu
    He, Bo
    [J]. PROCEEDINGS OF 2018 6TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (ICBCB 2018), 2018, : 28 - 31
  • [5] Natural selection drives the accumulation of amino acid tandem repeats in human proteins
    Mularoni, Loris
    Ledda, Alice
    Toll-Riera, Macarena
    Mar Alba, M.
    [J]. GENOME RESEARCH, 2010, 20 (06) : 745 - 754
  • [6] Predicting disordered regions in proteins using the profiles of amino acid indices
    Han, Pengfei
    Zhang, Xiuzhen
    Feng, Zhi-Ping
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [7] Predicting disordered regions in proteins using the profiles of amino acid indices
    Pengfei Han
    Xiuzhen Zhang
    Zhi-Ping Feng
    [J]. BMC Bioinformatics, 10
  • [8] Highly constrained proteins contain an unexpectedly large number of amino acid tandem repeats
    Mularoni, Loris
    Veitia, Reiner A.
    Alba, M. Mar
    [J]. GENOMICS, 2007, 89 (03) : 316 - 325
  • [9] Markov Models of Amino Acid Substitution to Study Proteins with Intrinsically Disordered Regions
    Szalkowski, Adam M.
    Anisimova, Maria
    [J]. PLOS ONE, 2011, 6 (05):
  • [10] Amino acid substitution scoring matrices specific to intrinsically disordered regions in proteins
    Trivedi, Rakesh
    Nagarajaram, Hampapathalu Adimurthy
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)