Transcription Factor Binding Site Prediction Using CnNet Approach

被引:0
|
作者
Masood, M. Mohamed Divan [1 ]
Manjula, D. [2 ]
Sugumaran, Vijayan [3 ,4 ]
机构
[1] BS Abdur Rahman Crescent Inst Sci & Technol, Dept Comp Sci & Engn, Chennai 600048, India
[2] Vellore Inst Technol, Dept Comp Sci & Engn, Chennai 600127, India
[3] Oakland Univ, Dept Decis & Informat Sci, Rochester, MI 48309 USA
[4] Oakland Univ, Ctr Data Sci & Big Data Analyt, Rochester, MI 48309 USA
关键词
DNA; Hidden Markov models; Gene expression; Pulse width modulation; Proteins; Probes; Genetics; Motif discovery; transcription factor (TF) binding site; convolution neural network (CNN); multiple expression motifs for motif elicitation (MEME); sequence specificity; MOTIF DISCOVERY; GENE-EXPRESSION; DNA; SEQUENCE; ALGORITHM; STRATEGY; SEARCH;
D O I
10.1109/TCBB.2024.3411024
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Controlling the gene expression is the most important development in a living organism, which makes it easier to find different kinds of diseases and their causes. It's very difficult to know what factors control the gene expression. Transcription Factor (TF) is a protein that plays an important role in gene expression. Discovering the transcription factor has immense biological significance, however, it is challenging to develop novel techniques and evaluation for regulatory developments in biological structures. In this research, we mainly focus on 'sequence specificities' that can be ascertained from experimental data with 'deep learning' techniques, which offer a scalable, flexible and unified computational approach for predicting transcription factor binding. Specifically, Multiple Expression motifs for Motif Elicitation (MEME) technique with Convolution Neural Network (CNN) named as CnNet, has been used for discovering the 'sequence specificities' of DNA gene sequences dataset. This process involves two steps: a) discovering the motifs that are capable of identifying useful TF binding site by using MEME technique, and b) computing a score indicating the likelihood of a given sequence being a useful binding site by using CNN technique. The proposed CnNet approach predicts the TF binding score with much better accuracy compared to existing approaches.
引用
收藏
页码:1721 / 1730
页数:10
相关论文
共 50 条
  • [41] An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs
    Garcia-Alcalde, Fernando
    Blanco, Armando
    Shepherd, Adrian J.
    BMC BIOINFORMATICS, 2010, 11
  • [42] A Graphical Modelling Approach to the Dissection of Highly Correlated Transcription Factor Binding Site Profiles
    Stojnic, Robert
    Fu, Audrey Qiuyan
    Adryan, Boris
    PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (11)
  • [43] A sequential Monte Carlo EM approach to the transcription factor binding site identification problem
    Jackson, Edmund S.
    Fitzgerald, William J.
    BIOINFORMATICS, 2007, 23 (11) : 1313 - 1320
  • [44] An intuitionistic approach to scoring DNA sequences against transcription factor binding site motifs
    Fernando Garcia-Alcalde
    Armando Blanco
    Adrian J Shepherd
    BMC Bioinformatics, 11
  • [45] Definition and prediction of the full range of transcription factor binding sites - the hepatocyte nuclear factor 1 dimeric site
    Locker, J
    Ghosh, D
    Luc, PV
    Zheng, JH
    NUCLEIC ACIDS RESEARCH, 2002, 30 (17) : 3809 - 3817
  • [46] Structure-based prediction of transcription factor binding sites using a protein-DNA docking approach
    Liu, Zhijie
    Guo, Jun-Tao
    Li, Ting
    Xu, Ying
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 72 (04) : 1114 - 1124
  • [47] Knowledge-based three-body potential for transcription factor binding site prediction
    Qin, Wenyi
    Zhao, Guijun
    Carson, Matthew
    Jia, Caiyan
    Lu, Hui
    IET SYSTEMS BIOLOGY, 2016, 10 (01) : 23 - 29
  • [48] motifStack for the analysis of transcription factor binding site evolution
    Ou, Jianhong
    Wolfe, Scot A.
    Brodsky, Michael H.
    Zhu, Lihua Julie
    NATURE METHODS, 2018, 15 (01) : 8 - 9
  • [49] Evolutionary Origins of Transcription Factor Binding Site Clusters
    He, Xin
    Duque, Thyago S. P. C.
    Sinha, Saurabh
    MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (03) : 1059 - 1070
  • [50] POBO, transcription factor binding site verification with bootstrapping
    Kankainen, M
    Holm, L
    NUCLEIC ACIDS RESEARCH, 2004, 32 : W222 - W229