Interpretable CRISPR/Cas9 off-target activities with mismatches and indels prediction using BERT

被引:2
|
作者
Luo, Ye [1 ]
Chen, Yaowen [1 ]
Xie, HuanZeng [1 ]
Zhu, Wentao [1 ]
Zhang, Guishan [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPER/Cas9; Off-target; BERT; Adaptive batch-wise olass balancing; Deep learning; GENOME EDITING TECHNOLOGIES; CLASSIFICATION; CRISPR-CAS9; SPECIFICITY; DESIGN; CAS9; SYSTEMS; DNA;
D O I
10.1016/j.compbiomed.2024.107932
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Off-target effects of CRISPR/Cas9 can lead to suboptimal genome editing outcomes. Numerous deep learning-based approaches have achieved excellent performance for off-target prediction; however, few can predict the off-target activities with both mismatches and indels between single guide RNA (sgRNA) and target DNA sequence pair. In addition, data imbalance is a common pitfall for off-target prediction. Moreover, due to the complexity of genomic contexts, generating an interpretable model also remains challenged. To address these issues, firstly we developed a BERT-based model called CRISPR-BERT for enhancing the prediction of off-target activities with both mismatches and indels. Secondly, we proposed an adaptive batch-wise class balancing strategy to combat the noise exists in imbalanced off-target data. Finally, we applied a visualization approach for investigating the generalizable nucleotide position-dependent patterns of sgRNA-DNA pair for off-target activity. In our comprehensive comparison to existing methods on five mismatches-only datasets and two mismatches-and-indels datasets, CRISPR-BERT achieved the best performance in terms of AUROC and PRAUC. Besides, the visualization analysis demonstrated how implicit knowledge learned by CRISPR-BERT facilitates off-target prediction, which shows potential in model interpretability. Collectively, CRISPR-BERT provides an accurate and interpretable framework for off-target prediction, further contributes to sgRNA optimization in practical use for improved target specificity in CRISPR/Cas9 genome editing. The source code is available at https://github.com/BrokenStringx/CRISPR-BERT
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Off-target evaluation of LTR targeted anti-HIV CRISPR/Cas9 therapy
    Link, Robert
    Nonnemacher, Michael
    Wigdahl, Brian
    Dampier, Will
    JOURNAL OF NEUROVIROLOGY, 2018, 24 : S49 - S49
  • [32] Current Bioinformatics Tools to Optimize CRISPR/Cas9 Experiments to Reduce Off-Target Effects
    Naeem, Muhammad
    Alkhnbashi, Omer S.
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (07)
  • [33] Off-target evaluation of LTR targeted anti-HIV CRISPR/Cas9 therapy
    Link, Robert
    Nonnemacher, Michael
    Wigdahl, Brian
    Dampier, Will
    JOURNAL OF NEUROIMMUNE PHARMACOLOGY, 2018, 13 : S49 - S49
  • [34] Cas9 loosens its grip on off-target sites
    Christopher E Nelson
    Charles A Gersbach
    Nature Biotechnology, 2016, 34 : 299 - 299
  • [35] DNA stretching induces Cas9 off-target activity
    Newton, Matthew D.
    Taylor, Benjamin J.
    Driessen, Rosalie P. C.
    Roos, Leonie
    Cvetesic, Nevena
    Allyjaun, Shenaz
    Lenhard, Boris
    Cuomo, Maria Emanuela
    Rueda, David S.
    NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2019, 26 (03) : 185 - +
  • [36] CRISPR/CAS9 Target Prediction with Deep Learning
    Aktas, Ozlem
    Dogan, Elif
    Ensari, Tolga
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [37] DNA stretching induces Cas9 off-target activity
    Matthew D. Newton
    Benjamin J. Taylor
    Rosalie P. C. Driessen
    Leonie Roos
    Nevena Cvetesic
    Shenaz Allyjaun
    Boris Lenhard
    Maria Emanuela Cuomo
    David S. Rueda
    Nature Structural & Molecular Biology, 2019, 26 : 185 - 192
  • [38] Cas9 loosens its grip on off-target sites
    Nelson, Christopher E.
    Gersbach, Charles A.
    NATURE BIOTECHNOLOGY, 2016, 34 (03) : 298 - 299
  • [39] Transformer-based anti-noise models for CRISPR-Cas9 off-target activities prediction
    Guan, Zengrui
    Jiang, Zhenran
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [40] Validation of CRISPR/Cas9 Off-Target Discovery Profiles from In Silico Prediction, Cell-Based & Biochemical-Based Assays with Targeted Off-Target Sequencing
    Patel, Nishit
    MOLECULAR THERAPY, 2020, 28 (04) : 99 - 99