CRISPR-OTE: Prediction of CRISPR On-Target Efficiency Based on Multi-Dimensional Feature Fusion

被引:1
|
作者
Xie, J. [1 ]
Liu, M. [1 ]
Zhou, L. [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, China Hosp Dev Inst, Ctr Med Intelligent & Dev, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Genome editing; CRISPR; On-target efficiency; Deep learning; Prior knowledge; GUIDE-RNA; DESIGN; SINGLE; ENDONUCLEASE; SGRNAS; MODEL; CPF1;
D O I
10.1016/j.irbm.2022.07.003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a powerful genome editing technology. Guide RNA (gRNA) plays an essential guiding role in the CRISPR system by complementary base pairing with target DNA. Since the CRISPR targeting mechanism problem has not yet been fully resolved, it remains a challenge to predict gRNA on-target efficiency. Current gRNA design tools often lack efficient information extraction and cannot learn the target efficiency patterns thoroughly.Material and methods: In this study, CRISPR-OTE is proposed to consider both multi-dimensional sequence information and important complementary prior knowledge based on a simple but effective framework. CRISPR-OTE consists of the local-contextual information branch and the prior knowledge branch. The local-contextual information branch extracts multi-dimensional sequence features from the DNA primary sequence by a parallel framework of Convolutional Neural Networks (CNN) and bidirectional Long Short-Term Memory networks (biLSTM). The prior knowledge branch selects the optimal subset of physicochemical features to provide the neural network with complementary knowledge, such as complex secondary structures. A simple feature fusion strategy is also adopted to fully utilize multi-modal data from the two branches.Results: The experimental results show that the optimal subset of physicochemical features (RNA secondary structure and melting temperature of 34nt target) can effectively improve the prediction performance. Additionally, combining multi-dimensional sequence features and multi-modal features can extract information more comprehensively. Through transfer learning, CRISPR-OTE trained on the CRISPR-Cpf1 system can also be successfully applied to the CRISPR-Cas9 system.Conclusion: The performance of CRISPR-OTE is superior to other methods in different CRISPR systems and species. Therefore, CRISPR-OTE is a simple on-target efficiency prediction framework with better accuracy and generalization performance.(c) 2022 AGBM. Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Time Series Classification Based on Multi-Dimensional Feature Fusion
    Quan, Shuo
    Sun, Mengyu
    Zeng, Xiangyu
    Wang, Xuliang
    Zhu, Zeya
    [J]. IEEE ACCESS, 2023, 11 : 11066 - 11077
  • [2] DNA Steganalysis Based on Multi-dimensional Feature Extraction and Fusion
    Wang, Zhuang
    Xia, Jinyi
    Huang, Kaibo
    Guo, Shengnan
    Huang, Chenwei
    Yang, Zhongliang
    Zhou, Linna
    [J]. DIGITAL FORENSICS AND WATERMARKING, IWDW 2023, 2024, 14511 : 277 - 291
  • [3] Multi-dimensional feature fusion based on narrowband RCS for UAV
    Xuejian, Feng
    Chaoying, Huo
    Chenxi, Zhu
    Haochuan, Deng
    Hongcheng, Yin
    [J]. 2022 CROSS STRAIT RADIO SCIENCE & WIRELESS TECHNOLOGY CONFERENCE, CSRSWTC, 2022,
  • [4] Noodles, the all-in-one system for on-target efficiency analysis of CRISPR guide RNAs
    Lin, Dongfa
    Najam, Syeda Sadia
    Liu, Yu
    Murgia, Nicola
    Vinnikov, Ilya A.
    [J]. METHODSX, 2024, 12
  • [5] CRISPRLearner: A Deep Learning-Based System to Predict CRISPR/Cas9 sgRNA On-Target Cleavage Efficiency
    Dimauro, Giovanni
    Colagrande, Pierpasquale
    Carlucci, Roberto
    Ventura, Mario
    Bevilacqua, Vitoantonio
    Caivano, Danilo
    [J]. ELECTRONICS, 2019, 8 (12)
  • [6] Speech emotion recognition based on multi-dimensional feature extraction and multi-scale feature fusion
    Yu, Lingli
    Xu, Fengjun
    Qu, Yundong
    Zhou, Kaijun
    [J]. APPLIED ACOUSTICS, 2024, 216
  • [7] CRISPR-Cas9-based mutagenesis frequently provokes on-target mRNA misregulation
    Tuladhar, Rubina
    Yeu, Yunku
    Piazza, John Tyler
    Tan, Zhen
    Clemenceau, Jean Rene
    Wu, Xiaofeng
    Barrett, Quinn
    Herbert, Jeremiah
    Mathews, David H.
    Kim, James
    Hwang, Tae Hyun
    Lum, Lawrence
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [8] CRISPR-Cas9-based mutagenesis frequently provokes on-target mRNA misregulation
    Rubina Tuladhar
    Yunku Yeu
    John Tyler Piazza
    Zhen Tan
    Jean Rene Clemenceau
    Xiaofeng Wu
    Quinn Barrett
    Jeremiah Herbert
    David H. Mathews
    James Kim
    Tae Hyun Hwang
    Lawrence Lum
    [J]. Nature Communications, 10
  • [9] Novel Product Duration Prediction Method of Complicated Product Based on Multi-Dimensional Nonlinear Feature Reconstruction and Fusion
    Chang, Jiantao
    Qiao, Zixuan
    Kong, Xianguang
    Yang, Shengkang
    Luo, Caiwen
    [J]. Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2023, 59 (06): : 294 - 308
  • [10] Recognition of coal and gangue based on multi-dimensional gray gradient feature fusion
    Luo, Qisheng
    Wang, Shuang
    Li, Xin
    He, Lei
    [J]. ENERGY SOURCES PART A-RECOVERY UTILIZATION AND ENVIRONMENTAL EFFECTS, 2022, 44 (03) : 8060 - 8076