SeqEnhDL: sequence-based classification of cell type-specific enhancers using deep learning models

被引:1
|
作者
Wang, Yupeng [1 ,3 ]
Jaime-Lara, Rosario B. [2 ,3 ]
Roy, Abhrarup [3 ]
Sun, Ying [1 ]
Liu, Xinyue [1 ]
Joseph, Paule, V [2 ,3 ]
机构
[1] BDX Res & Consulting LLC, Herndon, VA 20171 USA
[2] NIAAA, Div Intramural Clin & Biol Res DICBR, NIH, Bethesda, MD 20892 USA
[3] NINR, Div Intramural Res, NIH, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
Enhancer; Classification; Deep learning; DNA sequence; Cell type;
D O I
10.1186/s13104-021-05518-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectiveTo address the challenge of computational identification of cell type-specific regulatory elements on a genome-wide scale.ResultsWe propose SeqEnhDL, a deep learning framework for classifying cell type-specific enhancers based on sequence features. DNA sequences of "strong enhancer" chromatin states in nine cell types from the ENCODE project were retrieved to build and test enhancer classifiers. For any DNA sequence, positional k-mer (k=5, 7, 9 and 11) fold changes relative to randomly selected non-coding sequences across each nucleotide position were used as features for deep learning models. Three deep learning models were implemented, including multi-layer perceptron (MLP), Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). All models in SeqEnhDL outperform state-of-the-art enhancer classifiers (including gkm-SVM and DanQ) in distinguishing cell type-specific enhancers from randomly selected non-coding sequences. Moreover, SeqEnhDL can directly discriminate enhancers from different cell types, which has not been achieved by other enhancer classifiers. Our analysis suggests that both enhancers and their tissue-specificity can be accurately identified based on their sequence features. SeqEnhDL is publicly available at https://github.com/wyp1125/SeqEnhDL.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Unified rational protein engineering with sequence-based deep representation learning
    Alley, Ethan C.
    Khimulya, Grigory
    Biswas, Surojit
    AlQuraishi, Mohammed
    Church, George M.
    NATURE METHODS, 2019, 16 (12) : 1315 - +
  • [32] Unified rational protein engineering with sequence-based deep representation learning
    Ethan C. Alley
    Grigory Khimulya
    Surojit Biswas
    Mohammed AlQuraishi
    George M. Church
    Nature Methods, 2019, 16 : 1315 - 1322
  • [33] Current sequence-based models capture gene expression determinants in promoters but mostly ignore distal enhancers
    Karollus, Alexander
    Mauermeier, Thomas
    Gagneur, Julien
    GENOME BIOLOGY, 2023, 24 (01)
  • [34] Current sequence-based models capture gene expression determinants in promoters but mostly ignore distal enhancers
    Alexander Karollus
    Thomas Mauermeier
    Julien Gagneur
    Genome Biology, 24
  • [35] iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
    Liao, Meng
    Zhao, Jian-ping
    Tian, Jing
    Zheng, Chun-Hou
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [36] iEnhancer-DCLA: using the original sequence to identify enhancers and their strength based on a deep learning framework
    Meng Liao
    Jian-ping Zhao
    Jing Tian
    Chun-Hou Zheng
    BMC Bioinformatics, 23
  • [37] Bangla Documents Classification using Transformer Based Deep Learning Models
    Rahman, Md Mahbubur
    Pramanik, Md Aktaruzzaman
    Sadik, Rifat
    Roy, Monikrishna
    Chakraborty, Partha
    2020 2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2020,
  • [38] Exploring the cardiovascular function of NO using cell type-specific knock out models for guanylyl cyclase
    Friebe, A.
    ACTA PHYSIOLOGICA, 2014, 210 : 8 - 8
  • [39] Viral sequence classification using deep learning algorithms
    Nieuwenhuijse, David
    Munnink, Bas Oude
    Phan, My
    Koopmans, Marion
    VIRUS EVOLUTION, 2019, 5 : S19 - S19
  • [40] Biological Sequence Classification Using Deep Learning Architectures
    Sivasubramanian, Arrun
    Prashanth, V. R.
    Kumar, S. Sachin
    Soman, K. P.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 529 - 537