Predicting Transcription Factor Binding Sites with Deep Learning

被引:0
|
作者
Ghosh, Nimisha [1 ]
Santoni, Daniele [2 ]
Saha, Indrajit [3 ]
Felici, Giovanni [2 ]
机构
[1] Siksha O Anusandhan Univ, Inst Tech Educ & Res, Dept Comp Sci & Informat Technol, Bhubaneswar 751030, India
[2] Natl Res Council Italy, Inst Syst Anal & Comp Sci Antonio Ruberti, I-00185 Rome, Italy
[3] Natl Inst Tech Teachers Training & Res, Dept Comp Sci & Engn, Kolkata 700106, India
关键词
capsule network; deep learning; DNA sequences; transcription factor binding sites (TFBSs); CAPSULE NETWORK; DNA; PROTEINS; IDENTIFICATION; VARIANTS;
D O I
10.3390/ijms25094990
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Prediction of binding sites for transcription factors is important to understand how the latter regulate gene expression and how this regulation can be modulated for therapeutic purposes. A consistent number of references address this issue with different approaches, Machine Learning being one of the most successful. Nevertheless, we note that many such approaches fail to propose a robust and meaningful method to embed the genetic data under analysis. We try to overcome this problem by proposing a bidirectional transformer-based encoder, empowered by bidirectional long-short term memory layers and with a capsule layer responsible for the final prediction. To evaluate the efficiency of the proposed approach, we use benchmark ChIP-seq datasets of five cell lines available in the ENCODE repository (A549, GM12878, Hep-G2, H1-hESC, and Hela). The results show that the proposed method can predict TFBS within the five different cell lines very well; moreover, cross-cell predictions provide satisfactory results as well. Experiments conducted across cell lines are reinforced by the analysis of five additional lines used only to test the model trained using the others. The results confirm that prediction across cell lines remains very high, allowing an extensive cross-transcription factor analysis to be performed from which several indications of interest for molecular biology may be drawn.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] Predicting transcription factor binding sites using DNA shape features based on shared hybrid deep learning architecture
    Wang, Siguo
    Zhang, Qinhu
    Shen, Zhen
    He, Ying
    Chen, Zhen-Heng
    Li, Jianqiang
    Huang, De-Shuang
    MOLECULAR THERAPY NUCLEIC ACIDS, 2021, 24 : 154 - 163
  • [12] A deep learning model for predicting transcription factor binding location at Single Nucleotide Resolution
    Salekin, Sirajul
    Zhang, Jianqiu
    Huang, Yufei
    2017 IEEE EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL & HEALTH INFORMATICS (BHI), 2017, : 57 - 60
  • [13] CiiiDER: A tool for predicting and analysing transcription factor binding sites
    Gearing, Linden J.
    Cumming, Helen E.
    Chapman, Ross
    Finkel, Alexander M.
    Woodhouse, Isaac B.
    Luu, Kevin
    Gould, Jodee A.
    Forster, Samuel C.
    Hertzog, Paul J.
    PLOS ONE, 2019, 14 (09):
  • [14] Predicting transcription factor binding sites using structural knowledge
    Kaplan, T
    Friedman, N
    Margalit, H
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2005, 3500 : 522 - 537
  • [15] Recurrent Neural Network for Predicting Transcription Factor Binding Sites
    Zhen Shen
    Wenzheng Bao
    De-Shuang Huang
    Scientific Reports, 8
  • [16] Recurrent Neural Network for Predicting Transcription Factor Binding Sites
    Shen, Zhen
    Bao, Wenzheng
    Huang, De-Shuang
    SCIENTIFIC REPORTS, 2018, 8
  • [17] DeepSTF: predicting transcription factor binding sites by interpretable deep neural networks combining sequence and shape
    Ding, Pengju
    Wang, Yifei
    Zhang, Xinyu
    Gao, Xin
    Liu, Guozhu
    Yu, Bin
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (04)
  • [18] Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning
    Jing, Fang
    Zhang, Shao-Wu
    Cao, Zhen
    Zhang, Shihua l
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 241 - 252
  • [19] Prediction of Transcription Factor Binding Sites on Cell-Free DNA Based on Deep Learning
    Qi, Ting
    Zhou, Ying
    Sheng, Yuqi
    Li, Zhihui
    Yang, Yuwei
    Liu, Quanjun
    Ge, Qinyu
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2024, 64 (10) : 4002 - 4008
  • [20] Toward an atomistic model for predicting transcription-factor binding sites
    Endres, RG
    Schulthess, TC
    Wingreen, NS
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (02) : 262 - 268