Benchmarking deep learning methods for predicting CRISPR/Cas9 sgRNA on- and off-target activities

被引:10
|
作者
Zhang, Guishan [3 ]
Luo, Ye [3 ]
Dai, Xianhua [1 ,4 ]
Dai, Zhiming [2 ,5 ]
机构
[1] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen 518107, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
[3] Shantou Univ, Coll Engn, Shantou, Peoples R China
[4] Sun Yat Sen Univ, Sch Cyber Sci & Technol, Shenzhen, Peoples R China
[5] Sun Yat sen Univ, Sch Comp Sci & Engn, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
CRISPR/Cas9; sgRNA; deep learning; on-target; off-target; GUIDE RNA DESIGN; CHROMATIN ACCESSIBILITY; CLEAVAGE EFFICIENCY; SEQUENCE FEATURES; PAM COMPATIBILITY; NEURAL-NETWORKS; GENOME; CRISPR-CAS9; CAS9; SPECIFICITY;
D O I
10.1093/bib/bbad333
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In silico design of single guide RNA (sgRNA) plays a critical role in clustered regularly interspaced, short palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9) system. Continuous efforts are aimed at improving sgRNA design with efficient on-target activity and reduced off-target mutations. In the last 5 years, an increasing number of deep learning-based methods have achieved breakthrough performance in predicting sgRNA on- and off-target activities. Nevertheless, it is worthwhile to systematically evaluate these methods for their predictive abilities. In this review, we conducted a systematic survey on the progress in prediction of on- and off-target editing. We investigated the performances of 10 mainstream deep learning-based on-target predictors using nine public datasets with different sample sizes. We found that in most scenarios, these methods showed superior predictive power on large- and medium-scale datasets than on small-scale datasets. In addition, we performed unbiased experiments to provide in-depth comparison of eight representative approaches for off-target prediction on 12 publicly available datasets with various imbalanced ratios of positive/negative samples. Most methods showed excellent performance on balanced datasets but have much room for improvement on moderate- and severe-imbalanced datasets. This study provides comprehensive perspectives on CRISPR/Cas9 sgRNA on- and off-target activity prediction and improvement for method development.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] A comprehensive study comparing on- and off-target levels of the most common forms of CRISPR/Cas9 guide RNAs
    Jacobi, A.
    Rettig, G.
    Schubert, M.
    Shapiro, J.
    Iancu, O.
    Tovin, A.
    Cedrone, E.
    Dobrovolskaia, M.
    Hendel, A.
    Behlke, M.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 788 - 789
  • [22] CRISPR/CAS9 Target Prediction with Deep Learning
    Aktas, Ozlem
    Dogan, Elif
    Ensari, Tolga
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [23] Crispr-SGRU: Prediction of CRISPR/Cas9 Off-Target Activities with Mismatches and Indels Using Stacked BiGRU
    Zhang, Guishan
    Luo, Ye
    Xie, Huanzeng
    Dai, Zhiming
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (20)
  • [24] Accurate deep learning off-target prediction with novel sgRNA-DNA sequence encoding in CRISPR-Cas9 gene editing
    Charlier, Jeremy
    Nadon, Robert
    Makarenkov, Vladimir
    BIOINFORMATICS, 2021, 37 (16) : 2299 - 2307
  • [25] Off-target predictions in CRISPR-Cas9 gene editing using deep learning
    Lin, Jiecong
    Wong, Ka-Chun
    BIOINFORMATICS, 2018, 34 (17) : 656 - 663
  • [26] CRISPR-DIPOFF: an interpretable deep learning approach for CRISPR Cas-9 off-target prediction
    Toufikuzzaman, Md
    Samee, Md Abul Hassan
    Rahman, M. Sohel
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [27] Recognition of CRISPR/Cas9 off-target sites through ensemble learning of uneven mismatch distributions
    Peng, Hui
    Zheng, Yi
    Zhao, Zhixun
    Liu, Tao
    Li, Jinyan
    BIOINFORMATICS, 2018, 34 (17) : 757 - 765
  • [28] Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9
    Doench, John G.
    Fusi, Nicolo
    Sullender, Meagan
    Hegde, Mudra
    Vaimberg, Emma W.
    Donovan, Katherine F.
    Smith, Ian
    Tothova, Zuzana
    Wilen, Craig
    Orchard, Robert
    Virgin, Herbert W.
    Listgarten, Jennifer
    Root, David E.
    NATURE BIOTECHNOLOGY, 2016, 34 (02) : 184 - +
  • [29] Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9
    John G Doench
    Nicolo Fusi
    Meagan Sullender
    Mudra Hegde
    Emma W Vaimberg
    Katherine F Donovan
    Ian Smith
    Zuzana Tothova
    Craig Wilen
    Robert Orchard
    Herbert W Virgin
    Jennifer Listgarten
    David E Root
    Nature Biotechnology, 2016, 34 : 184 - 191
  • [30] CRISPR-M: Predicting sgRNA off-target effect using a multi-view deep learning network
    Sun, Jialiang
    Guo, Jun
    Liu, Jian
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (03)