Predicting the effect of variants on splicing using Convolutional Neural Networks

被引:3
|
作者
Thanapattheerakul, Thanyathorn [1 ]
Engchuan, Worrawat [2 ,3 ]
Chan, Jonathan H. [1 ,4 ]
机构
[1] King Mongkuts Univ Technol Thonburi, Sch Informat Technol, Bangkok, Thailand
[2] Hosp Sick Children, Genet & Genome Biol, Toronto, ON, Canada
[3] Hosp Sick Children, Ctr Appl Genom, Toronto, ON, Canada
[4] King Mongkuts Univ Technol Thonburi, Sch Informat Technol, IC2 DLab, Bangkok, Thailand
来源
PEERJ | 2020年 / 8卷
关键词
Splice site; Convolutional neural networks; Deep learning; RNA Splice Sites; Binding sites; Splicing events; Genomic variants;
D O I
10.7717/peerj.9470
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Mutations that cause an error in the splicing of a messenger RNA (mRNA) can lead to diseases in humans. Various computational models have been developed to recognize the sequence pattern of the splice sites. In recent studies, Convolutional Neural Network (CNN) architectures were shown to outperform other existing models in predicting the splice sites. However, an insufficient effort has been put into extending the CNN model to predict the effect of the genomic variants on the splicing of mRNAs. This study proposes a framework to elaborate on the utility of CNNs to assess the effect of splice variants on the identification of potential disease-causing variants that disrupt the RNA splicing process. Five models, including three CNN-based and two non-CNN machine learning based, were trained and compared using two existing splice site datasets, Genome Wide Human splice sites (GWH) and a dataset provided at the Deep Learning and Artificial Intelligence winter school 2018 (DLAI). The donor sites were also used to test on the HSplice tool to evaluate the predictive models. To improve the effectiveness of predictive models, two datasets were combined. The CNN model with four convolutional layers showed the best splice site prediction performance with an AUPRC of 93.4% and 88.8% for donor and acceptor sites, respectively. The effects of variants on splicing were estimated by applying the best model on variant data from the ClinVar database. Based on the estimation, the framework could effectively differentiate pathogenic variants from the benign variants (p = 5.9x10(-7) These promising results support that the proposed framework could be applied in future genetic studies to identify disease causing loci involving the splicing mechanism. The datasets and Python scripts used in this study are available on the GitHub repository at https://github.com/smiile8888/rna-splice-sites-recognition.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Predicting Sleeping Quality Using Convolutional Neural Networks
    Sathish, Vidya Rohini Konanur
    Woo, Wai Lok
    Ho, Edmond S. L.
    [J]. ADVANCES IN CYBERSECURITY, CYBERCRIMES, AND SMART EMERGING TECHNOLOGIES, 2023, 4 : 175 - 184
  • [2] Predicting Magnetization Directions Using Convolutional Neural Networks
    Nurindrawati, Felicia
    Sun, Jiajia
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2020, 125 (10)
  • [3] Predicting mortgage default using convolutional neural networks
    Kvamme, Havard
    Sellereite, Nikolai
    Aas, Kjersti
    Sjursen, Steffen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2018, 102 : 207 - 217
  • [4] PREDICTING UNIVERSITY DROPOUT BY USING CONVOLUTIONAL NEURAL NETWORKS
    Mezzini, Mauro
    Bonavolonta, Gianmarco
    Agrusti, Francesco
    [J]. 13TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2019), 2019, : 9155 - 9163
  • [5] On Predicting Geolocation of Tweets Using Convolutional Neural Networks
    Huang, Binxuan
    Carley, Kathleen M.
    [J]. SOCIAL, CULTURAL, AND BEHAVIORAL MODELING, 2017, 10354 : 281 - 291
  • [6] Predicting Eye Fixations using Convolutional Neural Networks
    Liu, Nian
    Han, Junwei
    Zhang, Dingwen
    Wen, Shifeng
    Liu, Tianming
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 362 - 370
  • [7] Predicting functional effect of missense variants using graph attention neural networks
    Haicang Zhang
    Michelle S. Xu
    Xiao Fan
    Wendy K. Chung
    Yufeng Shen
    [J]. Nature Machine Intelligence, 2022, 4 : 1017 - 1028
  • [8] Predicting functional effect of missense variants using graph attention neural networks
    Zhang, Haicang
    Xu, Michelle S.
    Fan, Xiao
    Chung, Wendy K.
    Shen, Yufeng
    [J]. NATURE MACHINE INTELLIGENCE, 2022, 4 (11) : 1017 - 1028
  • [9] Portrait photography splicing detection using ensemble of convolutional neural networks
    Revi, K. Remya
    Wilscy, M.
    Antony, Rahul
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (05) : 5347 - 5357
  • [10] Predicting Landslides Using Locally Aligned Convolutional Neural Networks
    Hajimoradlou, Ainaz
    Roberti, Gioachino
    Poole, David
    [J]. PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3342 - 3348