Interpretable CRISPR/Cas9 off-target activities with mismatches and indels prediction using BERT

被引:2
|
作者
Luo, Ye [1 ]
Chen, Yaowen [1 ]
Xie, HuanZeng [1 ]
Zhu, Wentao [1 ]
Zhang, Guishan [1 ]
机构
[1] Shantou Univ, Coll Engn, Shantou 515063, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPER/Cas9; Off-target; BERT; Adaptive batch-wise olass balancing; Deep learning; GENOME EDITING TECHNOLOGIES; CLASSIFICATION; CRISPR-CAS9; SPECIFICITY; DESIGN; CAS9; SYSTEMS; DNA;
D O I
10.1016/j.compbiomed.2024.107932
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Off-target effects of CRISPR/Cas9 can lead to suboptimal genome editing outcomes. Numerous deep learning-based approaches have achieved excellent performance for off-target prediction; however, few can predict the off-target activities with both mismatches and indels between single guide RNA (sgRNA) and target DNA sequence pair. In addition, data imbalance is a common pitfall for off-target prediction. Moreover, due to the complexity of genomic contexts, generating an interpretable model also remains challenged. To address these issues, firstly we developed a BERT-based model called CRISPR-BERT for enhancing the prediction of off-target activities with both mismatches and indels. Secondly, we proposed an adaptive batch-wise class balancing strategy to combat the noise exists in imbalanced off-target data. Finally, we applied a visualization approach for investigating the generalizable nucleotide position-dependent patterns of sgRNA-DNA pair for off-target activity. In our comprehensive comparison to existing methods on five mismatches-only datasets and two mismatches-and-indels datasets, CRISPR-BERT achieved the best performance in terms of AUROC and PRAUC. Besides, the visualization analysis demonstrated how implicit knowledge learned by CRISPR-BERT facilitates off-target prediction, which shows potential in model interpretability. Collectively, CRISPR-BERT provides an accurate and interpretable framework for off-target prediction, further contributes to sgRNA optimization in practical use for improved target specificity in CRISPR/Cas9 genome editing. The source code is available at https://github.com/BrokenStringx/CRISPR-BERT
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Measuring and Reducing Off-Target Activities of Programmable Nucleases Including CRISPR-Cas9
    Koo, Taeyoung
    Lee, Jungjoon
    Kim, Jin-Soo
    MOLECULES AND CELLS, 2015, 38 (06) : 475 - 481
  • [42] CRISPR GUARD protects off-target sites from Cas9 nuclease activity using short guide RNAs
    Matthew A. Coelho
    Etienne De Braekeleer
    Mike Firth
    Michal Bista
    Sebastian Lukasiak
    Maria Emanuela Cuomo
    Benjamin J. M. Taylor
    Nature Communications, 11
  • [43] Efficient assessment of CRISPR/Cas9 off-target activity using a background double-strand break model
    Lescarbeau, R. M.
    Murray, B. A.
    Seitzer, J.
    Growe, J.
    Huang, H.
    Dombrowski, C.
    Morrissey, D. V.
    Barnes, T.
    HUMAN GENE THERAPY, 2016, 27 (11) : A131 - A132
  • [44] Searching for Potential gRNA Off-Target Sites for CRISPR/Cas9 using Automata Processing across Different Platforms
    Bo, Chunkun
    Dang, Vinh
    Sadredini, Elaheh
    Skadron, Kevin
    2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2018, : 737 - 748
  • [45] CRISPR/Cas9 can mediate high-efficiency off-target mutations in mice in vivo
    Aryal, Neeraj K.
    Wasylishen, Amanda R.
    Lozano, Guillermina
    CELL DEATH & DISEASE, 2018, 9
  • [46] CRISPR GUARD protects off-target sites from Cas9 nuclease activity using short guide RNAs
    Coelho, Matthew A.
    De Braekeleer, Etienne
    Firth, Mike
    Bista, Michal
    Lukasiak, Sebastian
    Cuomo, Maria Emanuela
    Taylor, Benjamin J. M.
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [47] Recognition of CRISPR/Cas9 off-target sites through ensemble learning of uneven mismatch distributions
    Peng, Hui
    Zheng, Yi
    Zhao, Zhixun
    Liu, Tao
    Li, Jinyan
    BIOINFORMATICS, 2018, 34 (17) : 757 - 765
  • [48] CRISPR/Cas9 can mediate high-efficiency off-target mutations in mice in vivo
    Neeraj K Aryal
    Amanda R Wasylishen
    Guillermina Lozano
    Cell Death & Disease, 9
  • [49] Avoiding the off-target effects of CRISPR/cas9 system is still a challenging accomplishment for genetic transformation
    Herai, Roberto H.
    GENE, 2019, 700 : 176 - 178
  • [50] Validation of an In Vitro CRISPR-Cas9 Off-Target Prediction Method in Rhesus Macaques
    AlJanahi, Aisha A.
    Lazzarotto, Cicera
    Yu, Kyung-Rok
    Hong, So Gun
    Chen, Shirley
    Donahue, Robert
    Li, Yuesheng
    Shin, Taehoon
    Tsai, Shengdar
    Dunbar, Cynthia
    MOLECULAR THERAPY, 2018, 26 (05) : 85 - 86