Transcription Factor Binding Site Prediction Using CnNet Approach

被引:0
|
作者
Masood, M. Mohamed Divan [1 ]
Manjula, D. [2 ]
Sugumaran, Vijayan [3 ,4 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Sci & Engn, Chennai 600048, India
[2] Vellore Inst Technol, Dept Comp Sci & Engn, Chennai 600127, India
[3] Oakland Univ, Dept Decis & Informat Sci, Rochester, MI 48309 USA
[4] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48309 USA
关键词
DNA; Hidden Markov models; Gene expression; Pulse width modulation; Proteins; Probes; Genetics; Motif discovery; transcription factor (TF) binding site; convolution neural network (CNN); multiple expression motifs for motif elicitation (MEME); sequence specificity; MOTIF DISCOVERY; GENE-EXPRESSION; DNA; SEQUENCE; ALGORITHM; STRATEGY; SEARCH;
D O I
10.1109/TCBB.2024.3411024
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Controlling the gene expression is the most important development in a living organism, which makes it easier to find different kinds of diseases and their causes. It's very difficult to know what factors control the gene expression. Transcription Factor (TF) is a protein that plays an important role in gene expression. Discovering the transcription factor has immense biological significance, however, it is challenging to develop novel techniques and evaluation for regulatory developments in biological structures. In this research, we mainly focus on 'sequence specificities' that can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for predicting transcription factor binding. Specifically, Multiple Expression motifs for Motif Elicitation (MEME) technique with Convolution Neural Network (CNN) named as CnNet, has been used for discovering the 'sequence specificities' of DNA gene sequences dataset. This process involves two steps: a) discovering the motifs that are capable of identifying useful TF binding site by using MEME technique, and b) computing a score indicating the likelihood of a given sequence being a useful binding site by using CNN technique. The proposed CnNet approach predicts the TF binding score with much better accuracy compared to existing approaches.
引用
收藏
页码:1721 / 1730
页数:10
相关论文
共 50 条
  • [31] Disentangling transcription factor binding site complexity
    Eggeling, Ralf
    NUCLEIC ACIDS RESEARCH, 2018, 46 (20)
  • [32] Dynamics of Transcription Factor Binding Site Evolution
    Tugrul, Murat
    Paixao, Tiago
    Barton, Nicholas H.
    Tkacik, Gasper
    PLOS GENETICS, 2015, 11 (11):
  • [33] Binding site of MraZ transcription factor in Mollicutes
    Fisunov, G. Y.
    Evsyutina, D. V.
    Semashko, T. A.
    Arzamasov, A. A.
    Manuvera, V. A.
    Letarov, A. V.
    Govorun, V. M.
    BIOCHIMIE, 2016, 125 : 59 - 65
  • [34] Prediction of transcription factor binding to DNA using rule induction methods
    Huss, Mikael
    Nordstrom, Karin
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2006, 3 (02) : 247 - 263
  • [35] Improving the predictive value of the competence transcription factor (ComK) binding site in Bacillus subtilis using a genomic approach
    Hamoen, LW
    Smits, WK
    de Jong, A
    Holsappel, S
    Kuipers, OP
    NUCLEIC ACIDS RESEARCH, 2002, 30 (24) : 5517 - 5528
  • [36] Probabilistic framework for transcription factor binding prediction
    Laehdesmaeki, Harri
    Shmulevich, Ilya
    2007 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS, 2007, : 95 - 98
  • [37] Integrated assessment and prediction of transcription factor binding
    Beyer, Andreas
    Workman, Christopher
    Hollunder, Jens
    Radke, Doerte
    Moeller, Ulrich
    Wilhelm, Thomas
    Ideker, Trey
    PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (06) : 615 - 626
  • [38] Transcription factor binding site identification using the self-organizing map
    Mahony, S
    Hendrix, D
    Golden, A
    Smith, TJ
    Rokhsar, DS
    BIOINFORMATICS, 2005, 21 (09) : 1807 - 1814
  • [39] Effect of Using Varying Negative Examples in Transcription Factor Binding Site Predictions
    Rezwan, Faisal
    Sun, Yi
    Davey, Neil
    Adams, Rod
    Rust, Alistair G.
    Robinson, Mark
    EVOLUTIONARY COMPUTATION, MACHINE LEARNING AND DATA MINING IN BIOINFORMATICS, 2011, 6623 : 1 - +
  • [40] Improved Models for Transcription Factor Binding Site Identification Using Nonindependent Interactions
    Zhao, Yue
    Ruan, Shuxiang
    Pandey, Manishi
    Stormo, Gary D.
    GENETICS, 2012, 191 (03) : 781 - U204