pysster: classification of biological sequences by learning sequence and structure motifs with convolutional neural networks

被引:64
|
作者
Budach, Stefan [1 ]
Marsico, Annalisa [1 ,2 ]
机构
[1] Max Planck Inst Mol Genet, Otto Warburg Lab, RNA Bioinformat, D-14195 Berlin, Germany
[2] Free Univ Berlin, Dept Math & Comp Sci, D-14195 Berlin, Germany
关键词
DNA;
D O I
10.1093/bioinformatics/bty222
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The Summary: Convolutional neural networks (CNNs) have been shown to perform exceptionally well in a variety of tasks, including biological sequence classification. Available implementations, however, are usually optimized for a particular task and difficult to reuse. To enable researchers to utilize these networks more easily, we implemented pysster, a Python package for training CNNs on biological sequence data. Sequences are classified by learning sequence and structure motifs and the package offers an automated hyper-parameter optimization procedure and options to visualize learned motifs along with information about their positional and class enrichment. The package runs seamlessly on CPU and GPU and provides a simple interface to train and evaluate a network with a handful lines of code. Using an RNA A-to-I editing dataset and cross-linking immunoprecipitation (CLIP)-seq binding site sequences, we demonstrate that pysster classifies sequences with higher accuracy than previous methods, such as GraphProt or ssHMM, and is able to recover known sequence and structure motifs.
引用
收藏
页码:3035 / 3037
页数:3
相关论文
共 50 条
  • [11] Sequence to Sequence ECG Cardiac Rhythm Classification Using Convolutional Recurrent Neural Networks
    Pokaprakarn, Teeranan
    Kitzmiller, Rebecca R.
    Moorman, J. Randall
    Lake, Doug E.
    Krishnamurthy, Ashok K.
    Kosorok, Michael R.
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (02) : 572 - 580
  • [12] Convolutional neural networks and extreme learning machines for malware classification
    Mugdha Jain
    William Andreopoulos
    Mark Stamp
    Journal of Computer Virology and Hacking Techniques, 2020, 16 : 229 - 244
  • [13] Learning Sparse Features in Convolutional Neural Networks for Image Classification
    Luo, Wei
    Li, Jun
    Xu, Wei
    Yang, Jian
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: IMAGE AND VIDEO DATA ENGINEERING, ISCIDE 2015, PT I, 2015, 9242 : 29 - 38
  • [14] Active Learning Strategies and Convolutional Neural Networks for Mammogram Classification
    Tozato, Joao Marcelo
    Bugatti, Pedro Henrique
    Maeda Saito, Priscila Tiemi
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING (ICAISC 2021), PT II, 2021, 12855 : 126 - 134
  • [15] Convolutional neural networks and extreme learning machines for malware classification
    Jain, Mugdha
    Andreopoulos, William
    Stamp, Mark
    JOURNAL OF COMPUTER VIROLOGY AND HACKING TECHNIQUES, 2020, 16 (03) : 229 - 244
  • [16] Convolutional neural networks for classification of alignments of non-coding RNA sequences
    Aoki, Genta
    Sakakibara, Yasubumi
    BIOINFORMATICS, 2018, 34 (13) : 237 - 244
  • [17] CLASSIFICATION OF SEVERELY OCCLUDED IMAGE SEQUENCES VIA CONVOLUTIONAL RECURRENT NEURAL NETWORKS
    Zheng, Jian
    Wang, Yifan
    Zhang, Xiaonan
    Li, Xiaohua
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 395 - 399
  • [18] Interpreting neural networks for biological sequences by learning stochastic masks
    Johannes Linder
    Alyssa La Fleur
    Zibo Chen
    Ajasja Ljubetič
    David Baker
    Sreeram Kannan
    Georg Seelig
    Nature Machine Intelligence, 2022, 4 : 41 - 54
  • [19] Interpreting neural networks for biological sequences by learning stochastic masks
    Linder, Johannes
    La Fleur, Alyssa
    Chen, Zibo
    Ljubetic, Ajasja
    Baker, David
    Kannan, Sreeram
    Seelig, Georg
    NATURE MACHINE INTELLIGENCE, 2022, 4 (01) : 41 - +
  • [20] Identification of Traditional Motifs using Convolutional Neural Networks
    Jurj, Sorin Liviu
    Opritoiu, Flavius
    Vladutiu, Mircea
    2018 IEEE 24TH INTERNATIONAL SYMPOSIUM FOR DESIGN AND TECHNOLOGY IN ELECTRONIC PACKAGING (SIITME), 2018, : 191 - 196