Genome mining for anti-CRISPR operons using machine learning

被引:0
|
作者
Yang, Bowen [1 ]
Khatri, Minal [2 ]
Zheng, Jinfang [1 ]
Deogun, Jitender [2 ]
Yin, Yanbin [1 ]
机构
[1] Univ Nebraska Lincoln, Nebraska Food Hlth Ctr, Dept Food Sci & Technol, Lincoln, NE 68508 USA
[2] Univ Nebraska, Sch Comp, Lincoln, NE 68588 USA
基金
美国国家卫生研究院; 美国农业部;
关键词
D O I
10.1093/bioinformatics/btad309
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Encoded by (pro-)viruses, anti-CRISPR (Acr) proteins inhibit the CRISPR-Cas immune system of their prokaryotic hosts. As a result, Acr proteins can be employed to develop more controllable CRISPR-Cas genome editing tools. Recent studies revealed that known acr genes often coexist with other acr genes and with phage structural genes within the same operon. For example, we found that 47 of 98 known acr genes (or their homologs) co-exist in the same operons. None of the current Acr prediction tools have considered this important genomic context feature. We have developed a new software tool AOminer to facilitate the improved discovery of new Acrs by fully exploiting the genomic context of known acr genes and their homologs.Results: AOminer is the first machine learning based tool focused on the discovery of Acr operons (AOs). A two-state HMM (hidden Markov model) was trained to learn the conserved genomic context of operons that contain known acr genes or their homologs, and the learnt features could distinguish AOs and non-AOs. AOminer allows automated mining for potential AOs from query genomes or operons. AOminer outperformed all existing Acr prediction tools with an accuracy = 0.85. AOminer will facilitate the discovery of novel anti-CRISPR operons.
引用
收藏
页数:3
相关论文
共 50 条
  • [1] AcrFinder: genome mining anti-CRISPR operons in prokaryotes and their viruses
    Yi, Haidong
    Huang, Le
    Yang, Bowen
    Gomez, Javi
    Zhang, Han
    Yin, Yanbin
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (W1) : W358 - W365
  • [2] AcrDB: a database of anti-CRISPR operons in prokaryotes and viruses
    Huang, Le
    Yang, Bowen
    Yi, Haidong
    Asif, Amina
    Wang, Jiawei
    Lithgow, Trevor
    Zhang, Han
    Minhas, Fayyaz ul Amir Afsar
    Yin, Yanbin
    [J]. NUCLEIC ACIDS RESEARCH, 2021, 49 (D1) : D622 - D629
  • [3] Machine learning predicts new anti-CRISPR proteins
    Eitzinger, Simon
    Asif, Amina
    Watters, Kyle E.
    Iavarone, Anthony T.
    Knott, Gavin J.
    Doudna, Jennifer A.
    Minhas, Fayyaz ul Amir Afsar
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (09) : 4698 - 4708
  • [4] PreAcrs: a machine learning framework for identifying anti-CRISPR proteins
    Lin Zhu
    Xiaoyu Wang
    Fuyi Li
    Jiangning Song
    [J]. BMC Bioinformatics, 23
  • [5] PreAcrs: a machine learning framework for identifying anti-CRISPR proteins
    Zhu, Lin
    Wang, Xiaoyu
    Li, Fuyi
    Song, Jiangning
    [J]. BMC BIOINFORMATICS, 2022, 23 (01)
  • [6] Prediction of protein–protein interactions between anti-CRISPR and CRISPR-Cas using machine learning technique
    Sneha Murmu
    Himanshushekhar Chaurasia
    Sayanti Guha Majumdar
    A. R. Rao
    Anil Rai
    Sunil Archak
    [J]. Journal of Plant Biochemistry and Biotechnology, 2023, 32 : 818 - 830
  • [7] Prediction of protein-protein interactions between anti-CRISPR and CRISPR-Cas using machine learning technique
    Murmu, Sneha
    Chaurasia, Himanshushekhar
    Majumdar, Sayanti Guha
    Rao, A. R.
    Rai, Anil
    Archak, Sunil
    [J]. JOURNAL OF PLANT BIOCHEMISTRY AND BIOTECHNOLOGY, 2023, 32 (04) : 818 - 830
  • [8] Machine-learning approach expands the repertoire of anti-CRISPR protein families
    Ayal B. Gussow
    Allyson E. Park
    Adair L. Borges
    Sergey A. Shmakov
    Kira S. Makarova
    Yuri I. Wolf
    Joseph Bondy-Denomy
    Eugene V. Koonin
    [J]. Nature Communications, 11
  • [9] Machine-learning approach expands the repertoire of anti-CRISPR protein families
    Gussow, Ayal B.
    Park, Allyson E.
    Borges, Adair L.
    Shmakov, Sergey A.
    Makarova, Kira S.
    Wolf, Yuri I.
    Bondy-Denomy, Joseph
    Koonin, Eugene V.
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [10] Structural Insights into Crispr and Anti-Crispr Systems for Precise Genome Editing
    Jiang, Fuguo
    Liu, Junjie
    Nogales, Eva
    Bondy-Denomy, Joseph
    Doudna, Jennifer
    [J]. PROTEIN SCIENCE, 2018, 27 : 55 - 56