Machine learning study of DNA binding by transcription factors from the LacI family

被引:2
|
作者
Fedonin, G. G. [1 ]
Rakhmaninova, A. B. [2 ]
Korostelev, Yu D. [2 ]
Laikova, O. N. [1 ]
Gelfand, M. S. [1 ]
机构
[1] Kharkevich Inst, Inst Informat Transmiss Problems, Moscow 127994, Russia
[2] Moscow MV Lomonosov State Univ, Dept Bioengn & Bioinformat, Moscow 119991, Russia
基金
俄罗斯基础研究基金会;
关键词
transcription factors; Naive Bayes classifier; Logistic Regression; Mutual Information; prokaryotes; LacI family; SPECIFICITY-DETERMINING RESIDUES; FUNCTIONAL SPECIFICITY; MUTUAL INFORMATION; PROTEINS; RECOGNITION; PREDICTION; SITES; SELECTION; SEQUENCE; TOOL;
D O I
10.1134/S0026893311040054
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We studied 1372 LacI-family transcription factors and their 4484 DNA binding sites using machine learning algorithms and feature selection techniques. The Naive Bayes classifier and Logistic Regression were used to predict binding sites given transcription factor sequences and to classify factor-site pairs on binding and non-binding ones. Prediction accuracy was estimated using 10-fold cross-validation. Experiments showed that the best prediction of nucleotide densities at selected site positions is obtained using only a few key protein sequence positions. These positions are stably selected by the forward feature selection based on the mutual information of factor-site position pairs.
引用
收藏
页码:667 / 679
页数:13
相关论文
共 50 条
  • [1] Machine learning study of DNA binding by transcription factors from the LacI family
    G. G. Fedonin
    A. B. Rakhmaninova
    Yu. D. Korostelev
    O. N. Laikova
    M. S. Gelfand
    [J]. Molecular Biology, 2011, 45 : 667 - 679
  • [2] Machine Learning Study of DNA Binding by Transcription Factors from the LacI Family
    Fedonin, Gennady G.
    Gelfand, Mikhail S.
    [J]. PATTERN RECOGNITION IN BIOINFORMATICS, 2010, 6282 : 15 - 26
  • [3] Deciphering the enigma of missing DNA binding domain of LacI family transcription factors
    Neetu, Neetu
    Katiki, Madhusudhanarao
    Mahto, Jai Krishna
    Sharma, Monica
    Narayanan, Anoop
    Maity, Sudipa
    Tomar, Shailly
    Ambatipudi, Kiran
    Sharma, Ashwani Kumar
    Yernool, Dinesh
    Kumar, Pravindra
    [J]. ARCHIVES OF BIOCHEMISTRY AND BIOPHYSICS, 2021, 713
  • [4] DNAffinity: a machine-learning approach to predict DNA binding affinities of transcription factors
    Barissi, Sandro
    Sala, Alba
    Wieczor, Milosz
    Battistini, Federica
    Orozco, Modesto
    [J]. NUCLEIC ACIDS RESEARCH, 2022, 50 (16) : 9105 - 9114
  • [5] Identifying Common Structural DNA Properties in Transcription Factor Binding Site Sets of the LacI-GalR Family
    Meysman, Pieter
    Marchal, Kathleen
    Engelen, Kristof
    [J]. CURRENT BIOINFORMATICS, 2013, 8 (04) : 483 - 488
  • [6] DNA-BINDING SPECIFICITY OF GATA FAMILY TRANSCRIPTION FACTORS
    MERIKA, M
    ORKIN, SH
    [J]. MOLECULAR AND CELLULAR BIOLOGY, 1993, 13 (07) : 3999 - 4010
  • [7] Comparative Genomics of CytR, an Unusual Member of the LacI Family of Transcription Factors
    Sernova, Natalia V.
    Gelfand, Mikhail S.
    [J]. PLOS ONE, 2012, 7 (09):
  • [8] Machine learning predicts nucleosome binding modes of transcription factors
    Kishan, K. C.
    Subramanya, Sridevi K.
    Li, Rui
    Cui, Feng
    [J]. BMC BIOINFORMATICS, 2021, 22 (01)
  • [9] Machine learning predicts nucleosome binding modes of transcription factors
    K. C. Kishan
    Sridevi K. Subramanya
    Rui Li
    Feng Cui
    [J]. BMC Bioinformatics, 22
  • [10] A structural analysis of the bZIP family of transcription factors: Dimerization, DNA binding and interactions with other transcription factors
    Glover, JNM
    Chen, L
    Verdine, GL
    Harrison, SC
    [J]. BIOLOGICAL STRUCTURE AND DYNAMICS, VOL 1, 1996, : 23 - 33