The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes

被引:143
|
作者
Shmakov, Sergey A. [1 ,2 ]
Sitnik, Vassilii [1 ]
Makarova, Kira S. [2 ]
Wolf, Yuri I. [2 ]
Severinov, Konstantin V. [1 ,3 ,4 ]
Koonin, Eugene V. [2 ]
机构
[1] Skolkovo Inst Sci & Technol, Skolkovo, Russia
[2] Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
[3] State Univ New Jersey, Waksman Inst Microbiol Rutgers, Piscataway, NJ USA
[4] Russian Acad Sci, Inst Mol Genet, Moscow, Russia
来源
MBIO | 2017年 / 8卷 / 05期
关键词
CRISPR-Cas; bacteriophages; mobilome; oligonucleotide composition; spacer acquisition; ACQUIRED-RESISTANCE; TARGET RECOGNITION; IMMUNE-SYSTEMS; CAS SYSTEMS; RNA; DNA; GUIDE; DYNAMICS; ACQUISITION; ADAPTATION;
D O I
10.1128/mBio.01397-17
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called proto-spacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR-cas loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (similar to 7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes. IMPORTANCE The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers match any sequences in the current databases, and of these, only a minority correspond to known parasitic elements. We show that nearly all spacers with matches originate from viral or plasmid genomes that are either free or have been integrated into the host genome. We further demonstrate that spacers with no matches have the same properties as those of identifiable origins, strongly suggesting that all spacers originate from mobile elements.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Species-specific sequences at the omp2 locus of Brucella type strains
    Ficht, TA
    Husseinen, HS
    Derr, J
    Bearden, SW
    INTERNATIONAL JOURNAL OF SYSTEMATIC BACTERIOLOGY, 1996, 46 (01): : 329 - 331
  • [32] Species-specific primers in multiplex PCR for Bactrocera minax identification using an internal transcribed spacer
    Regmi, Prakriti
    Tsai, Cheng-Lung
    Lin, Ming-Ying
    Chuang, Yi-Yuan
    Yeh, Wen-Bin
    JOURNAL OF ASIA-PACIFIC ENTOMOLOGY, 2023, 26 (04)
  • [33] Species-specific repeat units in the intergenic spacer of the ribosomal RNA cistron of Anopheles aquasalis curry
    Perera, OP
    Cockburn, AF
    Mitchell, SE
    Conn, J
    Seawright, JA
    AMERICAN JOURNAL OF TROPICAL MEDICINE AND HYGIENE, 1998, 59 (05): : 673 - 678
  • [34] Molecular screening of xenodonor genomes for species-specific endogenous retroviral DNA sequences
    Hoopes, CW
    Platt, JL
    TRANSPLANTATION PROCEEDINGS, 1997, 29 (1-2) : 897 - 898
  • [35] Species-diagnostic and species-specific DNA sequences evenly distributed throughout pine and spruce chromosomes
    Mehes-Smith, Melanie
    Michael, Paul
    Nkongolo, Kabwe
    GENOME, 2010, 53 (10) : 769 - 777
  • [36] Species-Specific Actions of Incretin: From the Evolutionary Perspective
    Kawasaki, Yukiko
    Hamamoto, Yoshiyuki
    Koshiyama, Hiroyuki
    JAPANESE CLINICAL MEDICINE, 2010, 1 : 5 - 11
  • [37] Physical localisation of repetitive DNA sequences in Alstroemeria: karyotyping of two species with species-specific and ribosomal DNA
    Kamstra, SA
    Kuipers, AGJ
    DeJeu, MJ
    Ramanna, MS
    Jacobsen, E
    GENOME, 1997, 40 (05) : 652 - 658
  • [38] Morphologies and RUBISCO spacer sequences in the Pelvetia species from the northeast Pacific
    Lee, Yun Kyung
    Yoon, Hwan Su
    Kim, Young Jin
    Motomura, Taizo
    Boo, Sung Min
    PHYCOLOGIA, 1997, 36 (04) : 59 - 60
  • [39] Identification of differential abundance of satellite DNA sequences in Asclepias (Apocynaceae): in-depth characterization of species-specific sequences
    Graziele Clemente Costa
    Cicero Almeida
    Plant Systematics and Evolution, 2022, 308
  • [40] Identification of differential abundance of satellite DNA sequences in Asclepias (Apocynaceae): in-depth characterization of species-specific sequences
    Costa, Graziele Clemente
    Almeida, Cicero
    PLANT SYSTEMATICS AND EVOLUTION, 2022, 308 (06)