CRISPR-OTE: Prediction of CRISPR On-Target Efficiency Based on Multi-Dimensional Feature Fusion

被引:1
|
作者
Xie, J. [1 ]
Liu, M. [1 ]
Zhou, L. [2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Shanghai Jiao Tong Univ, China Hosp Dev Inst, Ctr Med Intelligent & Dev, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Genome editing; CRISPR; On-target efficiency; Deep learning; Prior knowledge; GUIDE-RNA; DESIGN; SINGLE; ENDONUCLEASE; SGRNAS; MODEL; CPF1;
D O I
10.1016/j.irbm.2022.07.003
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective: Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a powerful genome editing technology. Guide RNA (gRNA) plays an essential guiding role in the CRISPR system by complementary base pairing with target DNA. Since the CRISPR targeting mechanism problem has not yet been fully resolved, it remains a challenge to predict gRNA on-target efficiency. Current gRNA design tools often lack efficient information extraction and cannot learn the target efficiency patterns thoroughly.Material and methods: In this study, CRISPR-OTE is proposed to consider both multi-dimensional sequence information and important complementary prior knowledge based on a simple but effective framework. CRISPR-OTE consists of the local-contextual information branch and the prior knowledge branch. The local-contextual information branch extracts multi-dimensional sequence features from the DNA primary sequence by a parallel framework of Convolutional Neural Networks (CNN) and bidirectional Long Short-Term Memory networks (biLSTM). The prior knowledge branch selects the optimal subset of physicochemical features to provide the neural network with complementary knowledge, such as complex secondary structures. A simple feature fusion strategy is also adopted to fully utilize multi-modal data from the two branches.Results: The experimental results show that the optimal subset of physicochemical features (RNA secondary structure and melting temperature of 34nt target) can effectively improve the prediction performance. Additionally, combining multi-dimensional sequence features and multi-modal features can extract information more comprehensively. Through transfer learning, CRISPR-OTE trained on the CRISPR-Cpf1 system can also be successfully applied to the CRISPR-Cas9 system.Conclusion: The performance of CRISPR-OTE is superior to other methods in different CRISPR systems and species. Therefore, CRISPR-OTE is a simple on-target efficiency prediction framework with better accuracy and generalization performance.(c) 2022 AGBM. Published by Elsevier Masson SAS. All rights reserved.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Stress Classification Using ECGs Based on a Multi-Dimensional Feature Fusion of LSTM and Xception
    Song, Cheol Ho
    Kim, Jin Su
    Kim, Jae Myung
    Pan, Sungbum
    [J]. IEEE ACCESS, 2024, 12 : 19077 - 19086
  • [22] Stress Classification Using ECGs Based on a Multi-Dimensional Feature Fusion of LSTM and Xception
    Song, Cheol Ho
    Kim, Jin Su
    Kim, Jae Myung
    Pan, Sungbum
    [J]. IEEE Access, 2024, 12 : 19077 - 19086
  • [23] Recognising drivers? mental fatigue based on EEG multi-dimensional feature selection and fusion
    Zhang, Yuhao
    Guo, Hanying
    Zhou, Yongjiang
    Xu, Chengji
    Liao, Yang
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 79
  • [24] Fall detection system based on infrared array sensor and multi-dimensional feature fusion
    Yang, Yi
    Yang, Honglei
    Liu, Zhixin
    Yuan, Yazhou
    Guan, Xinping
    [J]. MEASUREMENT, 2022, 192
  • [25] Eye blink artifact detection based on multi-dimensional EEG feature fusion and optimization
    Wang, Meng
    Cui, Xiaonan
    Wang, Tianlei
    Jiang, Tiejia
    Gao, Feng
    Cao, Jiuwen
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 83
  • [26] Multi-dimensional feature fusion-based expert recommendation in community question answering
    Ye, Guanghui
    Li, Songye
    Wu, Lanqi
    Wei, Jinyu
    Wu, Chuan
    Wang, Yujie
    Li, Jiarong
    Liang, Bo
    Liu, Shuyan
    [J]. ELECTRONIC LIBRARY, 2024,
  • [27] Text matching model based on dense connection networkand multi-dimensional feature fusion
    Chen Y.-L.
    Tian W.-J.
    Cai X.-D.
    Zheng S.-T.
    [J]. Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2021, 55 (12): : 2352 - 2358
  • [28] CLSTM-AR-Based Multi-Dimensional Feature Fusion for Multi-Energy Load Forecasting
    Ren, Bowen
    Huang, Cunqiang
    Chen, Laijun
    Mei, Shengwei
    An, Juan
    Liu, Xingwen
    Ma, Hengrui
    [J]. ELECTRONICS, 2022, 11 (21)
  • [29] In vivo multi-dimensional CRISPR screens identify LGALS2 as an immunotherapy target in triple-negative breast cancer
    Ji, P.
    Gong, Y.
    Jin, M.
    Hu, X.
    Di, G.
    Shao, Z.
    [J]. ANNALS OF ONCOLOGY, 2022, 33 : S133 - S133
  • [30] DeepCRISTL: deep transfer learning to predict CRISPR/Cas9 on-target editing efficiency in specific cellular contexts
    Elkayam, Shai
    Tziony, Ido
    Orenstein, Yaron
    [J]. BIOINFORMATICS, 2024, 40 (08)