Principal component analysis for predicting transcription-factor binding motifs from array-derived data

被引:8
|
作者
Liu, YL
Vincenti, MP
Yokota, H [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Biomed Engn, Indianapolis, IN 46202 USA
[2] Purdue Univ, Weldon Sch Biomed Engn, W Lafayette, IN 47907 USA
[3] Indiana Univ Purdue Univ, Dept Anat & Cell Biol, Indianapolis, IN 46202 USA
[4] Dept Vet Affairs, White River Jct, VT 05009 USA
[5] Dartmouth Coll Sch Med, Dept Med, Hanover, NH 03755 USA
关键词
D O I
10.1186/1471-2105-6-276
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The responses to interleukin 1 (IL-1) in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs). In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD) is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results: The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC- 3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3') were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC- BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPAR., STAF, ROAZ, and NF kappa B, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion: The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Principal component analysis for predicting transcription-factor binding motifs from array-derived data
    Yunlong Liu
    Matthew P Vincenti
    Hiroki Yokota
    BMC Bioinformatics, 6
  • [2] Modelling and identification of transcription-factor binding motifs in human chondrogenesis
    不详
    SYSTEMS BIOLOGY, 2004, 1 (01): : 85 - 92
  • [3] Toward an atomistic model for predicting transcription-factor binding sites
    Endres, RG
    Schulthess, TC
    Wingreen, NS
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (02) : 262 - 268
  • [4] Predicting transcription factor binding motifs from DNA-binding domains, chromatin accessibility and gene expression data
    Zamanighomi, Mahdi
    Lin, Zhixiang
    Wang, Yong
    Jiang, Rui
    Wong, Wing Hung
    NUCLEIC ACIDS RESEARCH, 2017, 45 (10) : 5666 - 5677
  • [5] Model-based Comparative Prediction of Transcription-Factor Binding Motifs in Anabolic Responses in Bone
    Andy B.Chen
    Kazunori Hamamura
    Subburaman Mohan
    Hiroki Yokota
    Genomics Proteomics & Bioinformatics, 2007, (Z1) : 158 - 165
  • [6] FROM BINDING MOTIFS IN CHIP-SEQ DATA TO IMPROVED MODELS OF TRANSCRIPTION FACTOR BINDING SITES
    Kulakovskiy, Ivan
    Levitsky, Victor
    Oshchepkov, Dmitry
    Bryzgalov, Leonid
    Vorontsov, Ilya
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2013, 11 (01)
  • [7] MEDEA: analysis of transcription factor binding motifs in accessible chromatin
    Mariani, Luca
    Weinand, Kathryn
    Gisselbrecht, Stephen S.
    Bulyk, Martha L.
    GENOME RESEARCH, 2020, 30 (05) : 736 - 748
  • [8] Analysis of transcription-factor binding-site evolution by using the Hamilton-Jacobi equations
    Mark Ancliff
    Jeong-Man Park
    Journal of the Korean Physical Society, 2016, 69 : 1711 - 1719
  • [9] INVESTIGATION INTO RESULTS OF PRINCIPAL COMPONENT ANALYSIS OF DATA DERIVED FROM RANDOM NUMBERS
    FARMER, SA
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1971, 20 (04) : 63 - 72
  • [10] Principal Component Analysis and Factor Analysis for an Atanassov IF Data Set
    Duris, Viliam
    Bartkova, Renata
    Tirpakova, Anna
    MATHEMATICS, 2021, 9 (17)