Interpretable CRISPR/Cas9 off-target activities with mismatches and indels prediction using BERT

被引:2
|
作者
Luo, Ye [1 ]
Chen, Yaowen [1 ]
Xie, HuanZeng [1 ]
Zhu, Wentao [1 ]
Zhang, Guishan [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPER/Cas9; Off-target; BERT; Adaptive batch-wise olass balancing; Deep learning; GENOME EDITING TECHNOLOGIES; CLASSIFICATION; CRISPR-CAS9; SPECIFICITY; DESIGN; CAS9; SYSTEMS; DNA;
D O I
10.1016/j.compbiomed.2024.107932
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Off-target effects of CRISPR/Cas9 can lead to suboptimal genome editing outcomes. Numerous deep learning-based approaches have achieved excellent performance for off-target prediction; however, few can predict the off-target activities with both mismatches and indels between single guide RNA (sgRNA) and target DNA sequence pair. In addition, data imbalance is a common pitfall for off-target prediction. Moreover, due to the complexity of genomic contexts, generating an interpretable model also remains challenged. To address these issues, firstly we developed a BERT-based model called CRISPR-BERT for enhancing the prediction of off-target activities with both mismatches and indels. Secondly, we proposed an adaptive batch-wise class balancing strategy to combat the noise exists in imbalanced off-target data. Finally, we applied a visualization approach for investigating the generalizable nucleotide position-dependent patterns of sgRNA-DNA pair for off-target activity. In our comprehensive comparison to existing methods on five mismatches-only datasets and two mismatches-and-indels datasets, CRISPR-BERT achieved the best performance in terms of AUROC and PRAUC. Besides, the visualization analysis demonstrated how implicit knowledge learned by CRISPR-BERT facilitates off-target prediction, which shows potential in model interpretability. Collectively, CRISPR-BERT provides an accurate and interpretable framework for off-target prediction, further contributes to sgRNA optimization in practical use for improved target specificity in CRISPR/Cas9 genome editing. The source code is available at https://github.com/BrokenStringx/CRISPR-BERT
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Current advances in overcoming obstacles of CRISPR/Cas9 off-target genome editing
    Aquino-Jarquin, Guillermo
    MOLECULAR GENETICS AND METABOLISM, 2021, 134 (1-2) : 77 - 86
  • [22] Synergizing CRISPR/Cas9 off-target predictions for ensemble insights and practical applications
    Zhang, Shixiong
    Li, Xiangtao
    Lin, Qiuzhen
    Wong, Ka-Chun
    BIOINFORMATICS, 2019, 35 (07) : 1108 - 1115
  • [23] Efficient CRISPR/Cas9 genome editing with low off-target effects in zebrafish
    Hruscha, Alexander
    Krawitz, Peter
    Rechenberg, Alexandra
    Heinrich, Verena
    Hecht, Jochen
    Haass, Christian
    Schmid, Bettina
    DEVELOPMENT, 2013, 140 (24): : 4982 - 4987
  • [24] Multigene Knockout Utilizing Off-Target Mutations of the CRISPR/Cas9 System in Rice
    Endo, Masaki
    Mikami, Masafumi
    Toki, Seiichi
    PLANT AND CELL PHYSIOLOGY, 2015, 56 (01) : 41 - 47
  • [25] Biased and Unbiased Methods for the Detection of Off-Target Cleavage by CRISPR/Cas9: An Overview
    Martin, Francisco
    Sanchez-Hernandez, Sabina
    Gutierrez-Guerrero, Alejandra
    Pinedo-Gomez, Javier
    Benabdellah, Karim
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (09):
  • [26] Disrupting off-target Cas9 activity in the liver
    Dilliard, Sean A.
    Siegwart, Daniel J.
    NATURE BIOMEDICAL ENGINEERING, 2022, 6 (02) : 106 - 107
  • [27] Structural basis for Cas9 off-target activity
    Pacesa, Martin
    Lin, Chun-Han
    Clery, Antoine
    Saha, Aakash
    Arantes, Pablo R.
    Bargsten, Katja
    Irby, Matthew J.
    Allain, Frederic H. -T.
    Palermo, Giulia
    Cameron, Peter
    Donohoue, Paul D.
    Jinek, Martin
    CELL, 2022, 185 (22) : 4067 - +
  • [28] Structural basis for Cas9 off-target activity
    Pacesa, Martin
    Lin, Chun-Han
    Clery, Antoine
    Saha, Aakash
    Arantes, Pablo R.
    Bargsten, Katja
    Irby, Matthew J.
    Allain, Frederic H-T
    Palermo, Giulia
    Cameron, Peter
    Donohoue, Paul D.
    Jinek, Martin
    CELL, 2022, 185 (23) : 4067 - +
  • [29] Disrupting off-target Cas9 activity in the liver
    Sean A. Dilliard
    Daniel J. Siegwart
    Nature Biomedical Engineering, 2022, 6 : 106 - 107
  • [30] Strategies to Increase On-Target and Reduce Off-Target Effects of the CRISPR/Cas9 System in Plants
    Hajiahmadi, Zahra
    Movahedi, Ali
    Wei, Hui
    Li, Dawei
    Orooji, Yasin
    Ruan, Honghua
    Zhuge, Qiang
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (15)