NCYPred: A Bidirectional LSTM Network With Attention for Y RNA and Short Non-Coding RNA Classification

被引:4
|
作者
Lima, Diego de S. [1 ]
Amichi, Luiz J. A. [2 ]
Fernandez, Maria A. [3 ]
Constantino, Ademir A. [2 ]
V. Seixas, Flavio A. [1 ]
机构
[1] Univ Estadual Maringa, Dept Techonol, BR-87506370 Umuarama, Parana, Brazil
[2] Univ Estadual Maringa, Dept Informat, BR-87020900 Maringa, Parana, Brazil
[3] Univ Estadual Maringa, Dept Biotechnol Genet & Cell Biol, BR-87020900 Maringa, Parana, Brazil
关键词
Non-coding RNA; Y RNA; recurrent neural network; sequence classification; web server; EXPRESSION; MECHANISM;
D O I
10.1109/TCBB.2021.3131136
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Short non-coding RNAs (sncRNAs) are involved in multiple cellular processes and can be divided into dozens of classes. Among such classes, Y RNAs have been gaining attention, being essential factors for the initiation of DNA replication on vertebrates, as well as potential tumor biomarkers. Homologs have also been described in nematodes and insects, as well as related sequences in bacteria. Methods capable of accurately predicting Y RNA transcripts are lacking. In this work, we developed an attention-based LSTM network and built a classification model able to classify sncRNAs (including Y RNA) directly from nucleotide sequences. A dataset consisting of 45,447 sncRNA sequences, from a wide range of organisms, obtained from Rfam 14.3 was built. Performance evaluation demonstrated that our proposed method, NCYPred (Non-Coding/Y RNA Prediction), can accurately predict Y RNA sequences and their homologs, as well as 11 additional classes, achieving results comparable with state-of-the-art methods. We also demonstrate that applying t-SNE on learned sequence representations could be useful for sequence analysis. Our model is freely available as a web-server (https://www.gpea.uem.br/ncypred/).
引用
收藏
页码:557 / 565
页数:9
相关论文
共 50 条
  • [1] Class similarity network for coding and long non-coding RNA classification
    Zhang, Yu
    Long, Yahui
    Kwoh, Chee Keong
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [2] Class similarity network for coding and long non-coding RNA classification
    Yu Zhang
    Yahui Long
    Chee Keong Kwoh
    BMC Bioinformatics, 22
  • [3] ncRDeep: Non-coding RNA classification with convolutional neural network
    Chantsalnyam, Tuvshinbayar
    Lim, Dae Yeong
    Tayara, Hilal
    Chong, Kil To
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2020, 88
  • [4] Short Non-Coding RNA and Antidepressant Response
    Turecki, Gustavo
    NEUROPSYCHOPHARMACOLOGY, 2017, 42 : S18 - S18
  • [5] The role of non-coding RNA network in atherosclerosis
    Yuan, Yuchan
    Xu, Ling
    Geng, Zhaohong
    Liu, Jingjing
    Zhang, Lijiao
    Wu, Yuhang
    He, Dan
    Qu, Peng
    LIFE SCIENCES, 2021, 265
  • [6] Non-coding RNA
    Mattick, JS
    Makunin, IV
    HUMAN MOLECULAR GENETICS, 2006, 15 : R17 - R29
  • [7] Mechanistic insights into non-coding Y RNA processing
    Billmeier, Martina
    Green, Darrell
    Hall, Adam E.
    Turnbull, Carly
    Singh, Archana
    Xu, Ping
    Moxon, Simon
    Dalmay, Tamas
    RNA BIOLOGY, 2022, 19 (01) : 468 - 480
  • [8] Evaluation of deep learning in non-coding RNA classification
    Amin, Noorul
    McGrath, Annette
    Chen, Yi-Ping Phoebe
    NATURE MACHINE INTELLIGENCE, 2019, 1 (05) : 246 - 256
  • [9] Pattern recognition for the classification of long non-coding RNA
    Eubanks, Christopher
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 251
  • [10] Evaluation of deep learning in non-coding RNA classification
    Noorul Amin
    Annette McGrath
    Yi-Ping Phoebe Chen
    Nature Machine Intelligence, 2019, 1 : 246 - 256