CRISPR-M: Predicting sgRNA off-target effect using a multi-view deep learning network

被引:2
|
作者
Sun, Jialiang [1 ]
Guo, Jun [2 ]
Liu, Jian [1 ,3 ]
机构
[1] Nankai Univ, Coll Comp Sci, Tianjin, Peoples R China
[2] Northeastern Univ, Coll Software, Shenyang, Peoples R China
[3] Nankai Univ, Ctr Bioinformat & Intelligent Med, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
RNA; CAS9; DESIGN; SEQ; EFFICIENCY; NUCLEASES; CLEAVAGE; SYSTEM;
D O I
10.1371/journal.pcbi.1011972
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Using the CRISPR-Cas9 system to perform base substitutions at the target site is a typical technique for genome editing with the potential for applications in gene therapy and agricultural productivity. When the CRISPR-Cas9 system uses guide RNA to direct the Cas9 endonuclease to the target site, it may misdirect it to a potential off-target site, resulting in an unintended genome editing. Although several computational methods have been proposed to predict off-target effects, there is still room for improvement in the off-target effect prediction capability. In this paper, we present an effective approach called CRISPR-M with a new encoding scheme and a novel multi-view deep learning model to predict the sgRNA off-target effects for target sites containing indels and mismatches. CRISPR-M takes advantage of convolutional neural networks and bidirectional long short-term memory recurrent neural networks to construct a three-branch network towards multi-views. Compared with existing methods, CRISPR-M demonstrates significant performance advantages running on real-world datasets. Furthermore, experimental analysis of CRISPR-M under multiple metrics reveals its capability to extract features and validates its superiority on sgRNA off-target effect predictions. Genome editing using the CRISPR-Cas9 system, particularly base substitutions directed by guide RNA, holds immense potential for applications in gene therapy and agricultural productivity. However, the risk of unintended off-target effects poses a challenge, as misdirection of the Cas9 endonuclease can lead to unintended genome alterations. While computational methods exist for predicting off-target effects, there remains a need for encoding methods with more representation space and deep learning models with generalization capability and the adaptability. This paper introduces CRISPR-M, an innovative approach addressing the limitations of existing methods in predicting off-target effects, especially for target sites with indels and mismatches. CRISPR-M employs a novel encoding scheme and a multi-view deep learning model, combining convolutional neural networks and bidirectional long short-term memory recurrent neural networks. The three-branch network structure enhances the prediction accuracy by considering multiple perspectives. Compared with previous representative methods, CRISPR-M exhibits remarkable performance advantages when applied to real-world datasets. The experimental evaluation of CRISPR-M, assessed by various metrics such as ROC, PRC, GC content and melting temperature, demonstrates its ability to extract meaningful features and establishes its superiority in predicting off-target effects of sgRNA.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Benchmarking deep learning methods for predicting CRISPR/Cas9 sgRNA on- and off-target activities
    Zhang, Guishan
    Luo, Ye
    Dai, Xianhua
    Dai, Zhiming
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (06)
  • [2] Deep learning improves the ability of sgRNA off-target propensity prediction
    Qiaoyue Liu
    Xiang Cheng
    Gan Liu
    Bohao Li
    Xiuqin Liu
    BMC Bioinformatics, 21
  • [3] Deep learning improves the ability of sgRNA off-target propensity prediction
    Liu, Qiaoyue
    Cheng, Xiang
    Liu, Gan
    Li, Bohao
    Liu, Xiuqin
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [4] Prediction of sgRNA Off-Target Activity in CRISPR/Cas9 Gene Editing Using Graph Convolution Network
    Vinodkumar, Prasoon Kumar
    Ozcinar, Cagri
    Anbarjafari, Gholamreza
    ENTROPY, 2021, 23 (05)
  • [5] Off-target predictions in CRISPR-Cas9 gene editing using deep learning
    Lin, Jiecong
    Wong, Ka-Chun
    BIOINFORMATICS, 2018, 34 (17) : 656 - 663
  • [6] Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-Cas9 gene editing
    Charlier, Jeremy
    Nadon, Robert
    Makarenkov, Vladimir
    BIOINFORMATICS, 2021, 37 (16) : 2299 - 2307
  • [7] Deep cross-view autoencoder network for multi-view learning
    Mi, Jian-Xun
    Fu, Chang-Qing
    Chen, Tao
    Gou, Tingting
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24645 - 24664
  • [8] Deep cross-view autoencoder network for multi-view learning
    Jian-Xun Mi
    Chang-Qing Fu
    Tao Chen
    Tingting Gou
    Multimedia Tools and Applications, 2022, 81 : 24645 - 24664
  • [9] PDGNet: Predicting Disease Genes Using a Deep Neural Network With Multi-View Features
    Yang, Kuo
    Zheng, Yi
    Lu, Kezhi
    Chang, Kai
    Wang, Ning
    Shu, Zixin
    Yu, Jian
    Liu, Baoyan
    Gao, Zhuye
    Zhou, Xuezhong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 575 - 584
  • [10] CRISPR-DIPOFF: an interpretable deep learning approach for CRISPR Cas-9 off-target prediction
    Toufikuzzaman, Md
    Samee, Md Abul Hassan
    Rahman, M. Sohel
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)