Transcription Factor Binding Site Prediction Using CnNet Approach

被引:0
|
作者
Masood, M. Mohamed Divan [1 ]
Manjula, D. [2 ]
Sugumaran, Vijayan [3 ,4 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Sci & Engn, Chennai 600048, India
[2] Vellore Inst Technol, Dept Comp Sci & Engn, Chennai 600127, India
[3] Oakland Univ, Dept Decis & Informat Sci, Rochester, MI 48309 USA
[4] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48309 USA
关键词
DNA; Hidden Markov models; Gene expression; Pulse width modulation; Proteins; Probes; Genetics; Motif discovery; transcription factor (TF) binding site; convolution neural network (CNN); multiple expression motifs for motif elicitation (MEME); sequence specificity; MOTIF DISCOVERY; GENE-EXPRESSION; DNA; SEQUENCE; ALGORITHM; STRATEGY; SEARCH;
D O I
10.1109/TCBB.2024.3411024
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Controlling the gene expression is the most important development in a living organism, which makes it easier to find different kinds of diseases and their causes. It's very difficult to know what factors control the gene expression. Transcription Factor (TF) is a protein that plays an important role in gene expression. Discovering the transcription factor has immense biological significance, however, it is challenging to develop novel techniques and evaluation for regulatory developments in biological structures. In this research, we mainly focus on 'sequence specificities' that can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for predicting transcription factor binding. Specifically, Multiple Expression motifs for Motif Elicitation (MEME) technique with Convolution Neural Network (CNN) named as CnNet, has been used for discovering the 'sequence specificities' of DNA gene sequences dataset. This process involves two steps: a) discovering the motifs that are capable of identifying useful TF binding site by using MEME technique, and b) computing a score indicating the likelihood of a given sequence being a useful binding site by using CNN technique. The proposed CnNet approach predicts the TF binding score with much better accuracy compared to existing approaches.
引用
收藏
页码:1721 / 1730
页数:10
相关论文
共 50 条
  • [1] A Systems Biology Approach to Transcription Factor Binding Site Prediction
    Zhou, Xiang
    Sumazin, Pavel
    Rajbhandari, Presha
    Califano, Andrea
    PLOS ONE, 2010, 5 (03):
  • [2] Scoring functions for transcription factor binding site prediction
    Markus Friberg
    Peter von Rohr
    Gaston Gonnet
    BMC Bioinformatics, 6
  • [3] A web server for transcription factor binding site prediction
    Su, Gang
    Mao, Binchen
    Wang, Jin
    BIOINFORMATION, 2006, 1 (05) : 156 - 157
  • [4] The Next Generation of Transcription Factor Binding Site Prediction
    Mathelier, Anthony
    Wasserman, Wyeth W.
    PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (09)
  • [5] Evaluating tools for transcription factor binding site prediction
    Jayaram N.
    Usvyat D.
    Martin A.C.
    BMC Bioinformatics, 17 (1)
  • [6] Scoring functions for transcription factor binding site prediction
    Friberg, M
    von Rohr, P
    Gonnet, G
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [7] Robust Transcription Factor Binding Site Prediction Using Deep Neural Networks
    Geete, Kanu
    Pandey, Manish
    CURRENT BIOINFORMATICS, 2020, 15 (10) : 1137 - 1152
  • [8] Enhancing the interpretability of transcription factor binding site prediction using attention mechanism
    Sungjoon Park
    Yookyung Koh
    Hwisang Jeon
    Hyunjae Kim
    Yoonsun Yeo
    Jaewoo Kang
    Scientific Reports, 10
  • [9] Enhancing the interpretability of transcription factor binding site prediction using attention mechanism
    Park, Sungjoon
    Koh, Yookyung
    Jeon, Hwisang
    Kim, Hyunjae
    Yeo, Yoonsun
    Kang, Jaewoo
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [10] A biophysical approach to transcription factor binding site discovery
    Djordjevic, M
    Sengupta, AM
    Shraiman, BI
    GENOME RESEARCH, 2003, 13 (11) : 2381 - 2390