The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes

被引:143
|
作者
Shmakov, Sergey A. [1 ,2 ]
Sitnik, Vassilii [1 ]
Makarova, Kira S. [2 ]
Wolf, Yuri I. [2 ]
Severinov, Konstantin V. [1 ,3 ,4 ]
Koonin, Eugene V. [2 ]
机构
[1] Skolkovo Inst Sci & Technol, Skolkovo, Russia
[2] Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
[3] State Univ New Jersey, Waksman Inst Microbiol Rutgers, Piscataway, NJ USA
[4] Russian Acad Sci, Inst Mol Genet, Moscow, Russia
来源
MBIO | 2017年 / 8卷 / 05期
关键词
CRISPR-Cas; bacteriophages; mobilome; oligonucleotide composition; spacer acquisition; ACQUIRED-RESISTANCE; TARGET RECOGNITION; IMMUNE-SYSTEMS; CAS SYSTEMS; RNA; DNA; GUIDE; DYNAMICS; ACQUISITION; ADAPTATION;
D O I
10.1128/mBio.01397-17
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called proto-spacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR-cas loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (similar to 7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes. IMPORTANCE The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers match any sequences in the current databases, and of these, only a minority correspond to known parasitic elements. We show that nearly all spacers with matches originate from viral or plasmid genomes that are either free or have been integrated into the host genome. We further demonstrate that spacers with no matches have the same properties as those of identifiable origins, strongly suggesting that all spacers originate from mobile elements.
引用
收藏
页数:18
相关论文
共 50 条
  • [2] SPECIES-SPECIFIC REPEATED DNA-SEQUENCES FROM PETUNIA
    SHEPHERD, AL
    ANDERSON, S
    SMITH, SM
    PLANT SCIENCE, 1990, 67 (01) : 57 - 62
  • [3] Species-specific genomic sequences for classification of bacteria
    Paul, Bobby
    Raj, K. Kavia
    Murali, Thokur Sreepathy
    Satyamoorthy, K.
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [4] Variation and constraints in species-specific promoter sequences
    Calistri, Elisa
    Buiatti, Marcello
    Livi, Roberto
    JOURNAL OF THEORETICAL BIOLOGY, 2014, 363 : 357 - 366
  • [5] SPECIES-SPECIFIC DNA-SEQUENCES IN THE TRITICEAE
    ANAMTHAWATJONSSON, K
    HESLOPHARRISON, JS
    HEREDITAS, 1992, 116 (1-2): : 49 - 54
  • [6] Identification of the shortest species-specific oligonucleotide sequences
    Mouratidis, Ioannis
    Konnaris, Maxwell A.
    Chantzi, Nikol
    Chan, Candace S. Y.
    Patsakis, Michail
    Provatas, Kimonas
    Montgomery, Austin
    Baltoumas, Fotis A.
    Sha, Congzhou M.
    Mareboina, Manvita
    Pavlopoulos, Georgios A.
    Chartoumpekis, Dionysios V.
    Georgakopoulos-Soares, Ilias
    GENOME RESEARCH, 2025, 35 (02) : 279 - 295
  • [7] Species-specific accumulation of interspersed sequences in genus Saccharum
    Nakayama, S
    GENES & GENETIC SYSTEMS, 2004, 79 (06) : 361 - 365
  • [8] An examination of species-specific growing space utilization
    Lhotka, John M.
    Loewenstein, Edward F.
    CANADIAN JOURNAL OF FOREST RESEARCH, 2008, 38 (03) : 470 - 479
  • [9] In Silico Processing of the Complete CRISPR-Cas Spacer Space for Identification of PAM Sequences
    Mendoza, Brian J.
    Trinh, Cong T.
    BIOTECHNOLOGY JOURNAL, 2018, 13 (09)
  • [10] CRISPRs of Enterococcus faecalis and E. hirae Isolates from Pig Feces Have Species-Specific Repeats But Share Some Common Spacer Sequences
    Isha Katyal
    Bonnie Chaban
    Beata Ng
    Janet E. Hill
    Microbial Ecology, 2013, 66 : 182 - 188