SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models

被引:1
|
作者
Wang, Yupeng [1 ,3 ]
Jaime-Lara, Rosario B. [2 ,3 ]
Roy, Abhrarup [3 ]
Sun, Ying [1 ]
Liu, Xinyue [1 ]
Joseph, Paule, V [2 ,3 ]
机构
[1] BDX Res & Consulting LLC, Herndon, VA 20171 USA
[2] NIAAA, Div Intramural Clin & Biol Res DICBR, NIH, Bethesda, MD 20892 USA
[3] NINR, Div Intramural Res, NIH, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
Enhancer; Classification; Deep learning; DNA sequence; Cell type;
D O I
10.1186/s13104-021-05518-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectiveTo address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale.ResultsWe propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of "strong enhancer" chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k=5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL.
引用
收藏
页数:7
相关论文
共 50 条
  • [41] Sequence-Based Prediction of Cysteine Reactivity Using Machine Learning
    Wang, Haobo
    Chen, Xuemin
    Li, Can
    Liu, Yuan
    Yang, Fan
    Wang, Chu
    BIOCHEMISTRY, 2018, 57 (04) : 451 - 460
  • [42] Sequence-Based Prediction of Plant Allergenic Proteins: Machine Learning Classification Approach
    Nedyalkova, Miroslava
    Vasighi, Mahdi
    Azmoon, Amirreza
    Naneva, Ludmila
    Simeonov, Vasil
    ACS OMEGA, 2023, : 3698 - 3704
  • [43] Evaluation of Cross-Validation Strategies in Sequence-Based Binding Prediction-Using Deep Learning
    Lopez-del Rio, Angela
    None-Canals, Alfons
    Vidal, David
    Perera-Lluna, Alexandre
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2019, 59 (04) : 1645 - 1657
  • [44] Predicting subcellular location of protein with evolution information and sequence-based deep learning
    Liao, Zhijun
    Pan, Gaofeng
    Sun, Chao
    Tang, Jijun
    BMC Bioinformatics, 2021, 22
  • [45] Sequence-Based Prediction of Food-Originated ACE Inhibitory Peptides Using Deep Learning Algorithm
    Terziyska, Margarita
    Desseva, Ivelina
    Terziyski, Zhelyazko
    CONTEMPORARY METHODS IN BIOINFORMATICS AND BIOMEDICINE AND THEIR APPLICATIONS, 2022, 374 : 236 - 246
  • [46] Deep Gesture Generation for Social Robots Using Type-Specific Libraries
    Teshima, Hitoshi
    Wake, Naoki
    Thomas, Diego
    Nakashima, Yuta
    Kawasaki, Hiroshi
    Ikeuchi, Katsushi
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8286 - 8291
  • [47] Automated Brain Disease Classification using Transfer Learning based Deep Learning Models
    Alam, Farhana
    Tisha, Farhana Chowdhury
    Rahman, Sara Anisa
    Sultana, Samia
    Chowdhury, Md. Ahied Mahi
    Reza, Ahmed Wasif
    Shamsul, Mohammad
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 941 - 949
  • [48] Discovery of novel small molecule cell type-specific enhancers of NF-κB nuclear translocation
    Gong, Gangli
    Xie, Yuli
    Liu, Yidong
    Rinderspacher, Alison
    Deng, Shi-Xian
    Feng, Yan
    Zhu, Zhengxiang
    Tang, Yufei
    Wyler, Michael
    Aulner, Nathalie
    Toebben, Udo
    Smith, Deborah H.
    Branden, Lars
    Chung, Caty
    Schrer, Stephan
    Vidovic, Dusica
    Landry, Donald W.
    BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2009, 19 (04) : 1191 - 1194
  • [49] Long non-coding RNAs direct the SWI/SNF complex to cell type-specific enhancers
    James A. Oo
    Timothy Warwick
    Katalin Pálfi
    Frederike Lam
    Francois McNicoll
    Cristian Prieto-Garcia
    Stefan Günther
    Can Cao
    Yinuo Zhou
    Alexey A. Gavrilov
    Sergey V. Razin
    Alfredo Cabrera-Orefice
    Ilka Wittig
    Soni Savai Pullamsetti
    Leo Kurian
    Ralf Gilsbach
    Marcel H. Schulz
    Ivan Dikic
    Michaela Müller-McNicoll
    Ralf P. Brandes
    Matthias S. Leisegang
    Nature Communications, 16 (1)
  • [50] Sequence-based peptide identification, generation, and property prediction with deep learning: a review
    Chen, Xumin
    Li, Chen
    Bernards, Matthew T.
    Shi, Yao
    Shao, Qing
    He, Yi
    MOLECULAR SYSTEMS DESIGN & ENGINEERING, 2021, 6 (06): : 406 - 428