CRISPR-M: Predicting sgRNA off-target effect using a multi-view deep learning network

被引:2
|
作者
Sun, Jialiang [1 ]
Guo, Jun [2 ]
Liu, Jian [1 ,3 ]
机构
[1] Nankai Univ, Coll Comp Sci, Tianjin, Peoples R China
[2] Northeastern Univ, Coll Software, Shenyang, Peoples R China
[3] Nankai Univ, Ctr Bioinformat & Intelligent Med, Tianjin, Peoples R China
基金
中国国家自然科学基金;
关键词
RNA; CAS9; DESIGN; SEQ; EFFICIENCY; NUCLEASES; CLEAVAGE; SYSTEM;
D O I
10.1371/journal.pcbi.1011972
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Using the CRISPR-Cas9 system to perform base substitutions at the target site is a typical technique for genome editing with the potential for applications in gene therapy and agricultural productivity. When the CRISPR-Cas9 system uses guide RNA to direct the Cas9 endonuclease to the target site, it may misdirect it to a potential off-target site, resulting in an unintended genome editing. Although several computational methods have been proposed to predict off-target effects, there is still room for improvement in the off-target effect prediction capability. In this paper, we present an effective approach called CRISPR-M with a new encoding scheme and a novel multi-view deep learning model to predict the sgRNA off-target effects for target sites containing indels and mismatches. CRISPR-M takes advantage of convolutional neural networks and bidirectional long short-term memory recurrent neural networks to construct a three-branch network towards multi-views. Compared with existing methods, CRISPR-M demonstrates significant performance advantages running on real-world datasets. Furthermore, experimental analysis of CRISPR-M under multiple metrics reveals its capability to extract features and validates its superiority on sgRNA off-target effect predictions. Genome editing using the CRISPR-Cas9 system, particularly base substitutions directed by guide RNA, holds immense potential for applications in gene therapy and agricultural productivity. However, the risk of unintended off-target effects poses a challenge, as misdirection of the Cas9 endonuclease can lead to unintended genome alterations. While computational methods exist for predicting off-target effects, there remains a need for encoding methods with more representation space and deep learning models with generalization capability and the adaptability. This paper introduces CRISPR-M, an innovative approach addressing the limitations of existing methods in predicting off-target effects, especially for target sites with indels and mismatches. CRISPR-M employs a novel encoding scheme and a multi-view deep learning model, combining convolutional neural networks and bidirectional long short-term memory recurrent neural networks. The three-branch network structure enhances the prediction accuracy by considering multiple perspectives. Compared with previous representative methods, CRISPR-M exhibits remarkable performance advantages when applied to real-world datasets. The experimental evaluation of CRISPR-M, assessed by various metrics such as ROC, PRC, GC content and melting temperature, demonstrates its ability to extract meaningful features and establishes its superiority in predicting off-target effects of sgRNA.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Modeling CRISPR-Cas13d on-target and off-target effects using machine learning approaches
    Xiaolong Cheng
    Zexu Li
    Ruocheng Shan
    Zihan Li
    Shengnan Wang
    Wenchang Zhao
    Han Zhang
    Lumen Chao
    Jian Peng
    Teng Fei
    Wei Li
    Nature Communications, 14
  • [32] MultiChem: predicting chemical properties using multi-view graph attention network
    Moon, Heesang
    Rho, Mina
    BIODATA MINING, 2025, 18 (01):
  • [33] Prognosticating Colorectal Cancer Recurrence using an Interpretable Deep Multi-view Network
    Ho, Danliang
    Tan, Iain Bee Huat
    Motani, Mehul
    MACHINE LEARNING FOR HEALTH, VOL 158, 2021, 158 : 97 - 109
  • [34] MultiSpectralNet: Spectral Clustering Using Deep Neural Network for Multi-View Data
    Huang, Shutting
    Ota, Kaoru
    Dong, Mianxiong
    Li, Fanzhang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2019, 6 (04) : 749 - 760
  • [35] AntiMF: A deep learning framework for predicting anticancer peptides based on multi-view feature extraction
    Liu, Jingjing
    Li, Minghao
    Chen, Xin
    METHODS, 2022, 207 : 38 - 43
  • [36] Multi-view Cooperative Deep Convolutional Network for Facial Recognition with Small Samples Learning
    Alfakih, Amani
    Yang, Shuyuan
    Hu, Tao
    DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 16TH INTERNATIONAL CONFERENCE, 2020, 1003 : 207 - 216
  • [37] Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature
    Liu, Qiao
    He, Di
    Xie, Lei
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (10)
  • [38] Predicting drug-drug interactions based on multi-view and multichannel attention deep learning
    Huang, Liyu
    Chen, Qingfeng
    Lan, Wei
    HEALTH INFORMATION SCIENCE AND SYSTEMS, 2023, 11 (01)
  • [39] Efficient intelligent quality detection of pistachios using multi-view deep learning
    Zhu, Hongfei
    Fu, Huayu
    Wang, Cong
    Hua, Zhenlu
    Liu, Xingyu
    Shi, Weiming
    Zong, Ziyan
    Zhao, Yanshen
    Liu, Fei
    Deng, Limiao
    Yang, Ranbing
    Han, Zhongzhi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 143
  • [40] Improving deep learning based segmentation of scars using multi-view images
    Zhou, Jian
    Dai, Yuqing
    Liu, Dongmei
    Zhu, Weifang
    Xiang, Dehui
    Chen, Xinjian
    Shi, Fei
    Xia, Wentao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 94