Principal component analysis for predicting transcription-factor binding motifs from array-derived data

被引:8
|
作者
Liu, YL
Vincenti, MP
Yokota, H [1 ]
机构
[1] Indiana Univ Purdue Univ, Dept Biomed Engn, Indianapolis, IN 46202 USA
[2] Purdue Univ, Weldon Sch Biomed Engn, W Lafayette, IN 47907 USA
[3] Indiana Univ Purdue Univ, Dept Anat & Cell Biol, Indianapolis, IN 46202 USA
[4] Dept Vet Affairs, White River Jct, VT 05009 USA
[5] Dartmouth Coll Sch Med, Dept Med, Hanover, NH 03755 USA
关键词
D O I
10.1186/1471-2105-6-276
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The responses to interleukin 1 (IL-1) in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs). In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD) is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results: The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC- 3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3') were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC- BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPAR., STAF, ROAZ, and NF kappa B, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion: The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] maxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks
    Cazares, Tareian A.
    Rizvi, Faiz
    Iyer, Balaji
    Chen, Xiaoting
    Kotliar, Michael C.
    Bejjani, Anthony
    Wayman, Joseph T.
    Donmez, Omer
    Wronowski, Benjamin R.
    Parameswaran, Sreeja M.
    Kottyan, Leah
    Barski, Artem M.
    Weirauch, Matthew
    Prasath, V. B. Surya M.
    Miraldi, Emily
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (01)
  • [42] Assessment of Algorithms for Inferring Positional Weight Matrix Motifs of Transcription Factor Binding Sites Using Protein Binding Microarray Data
    Orenstein, Yaron
    Linhart, Chaim
    Shamir, Ron
    PLOS ONE, 2012, 7 (09):
  • [43] COMPUTER-AIDED ANALYSIS OF POTENTIAL TRANSCRIPTION-FACTOR BINDING-SITES IN THE RABBIT BETA-CASEIN GENE PROMOTER
    MALEWSKI, T
    ZWIERZCHOWSKI, L
    BIOSYSTEMS, 1995, 36 (02) : 109 - 119
  • [44] USE OF PRINCIPAL COMPONENT ANALYSIS ON DATA FROM CHEMICAL ANALYSIS OF TEA LEAVES
    WILLSON, KC
    FREEMAN, GH
    EXPERIMENTAL AGRICULTURE, 1970, 6 (04) : 319 - +
  • [45] NMR ANALYSIS OF THE DNA-BINDING DOMAIN DERIVED FROM THE TRANSCRIPTION FACTOR PHO4
    KREMER, W
    KING, DS
    WEMMER, DE
    JOURNAL OF CELLULAR BIOCHEMISTRY, 1995, : 60 - 60
  • [46] Understanding transcriptional regulation by integrative analysis of transcription factor binding data
    Cheng, Chao
    Alexander, Roger
    Min, Renqiang
    Leng, Jing
    Yip, Kevin Y.
    Rozowsky, Joel
    Yan, Koon-Kiu
    Dong, Xianjun
    Djebali, Sarah
    Ruan, Yijun
    Davis, Carrie A.
    Carninci, Piero
    Lassman, Timo
    Gingerasi, Thomas R.
    Guigo, Roderic
    Birney, Ewan
    Weng, Zhiping
    Snyder, Michael
    Gerstein, Mark
    GENOME RESEARCH, 2012, 22 (09) : 1658 - 1667
  • [47] DNA Motifs and Transcription Factor Binding Sites Analysis of Regulatory Regions of MicroRNAs Relating to Psoriasis Disease
    Wang, Lina
    Jing, Yuxiao
    Du, Jianqiang
    Wu, Xiaoming
    Guo, Jiaqi
    Zheng, Yan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING TECHNOLOGY (CSET2015), MEDICAL SCIENCE AND BIOLOGICAL ENGINEERING (MSBE2015), 2016, : 322 - 325
  • [48] Analysis of the association between transcription factor binding site variants and distinct accompanying regulatory motifs in yeast
    Chiang, Sufeng
    Swamy, Krishna B. S.
    Hsu, Ting-Wei
    Tsai, Zing Tsung-Yeh
    Lu, Henry Horng-Shing
    Wang, Daryi
    Tsai, Huai-Kuang
    GENE, 2012, 491 (02) : 237 - 245
  • [49] BiSAn: A software for efficient computation of transcription factor binding motifs for high throughput gene expression analysis
    Khan, Mohsin Amir Faiz
    Gorle, Chandrasekhar Babu
    Wang, Ping
    Liu, XiaoHui
    Li, Su-Ling
    2009 3RD INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1-11, 2009, : 631 - +
  • [50] THE APPLICATION OF PRINCIPAL COMPONENT AND FACTOR-ANALYSIS PROCEDURES TO DATA FOR ELEMENT CONCENTRATIONS IN AEROSOLS FROM A REMOTE REGION
    VANESPEN, P
    ADAMS, F
    ANALYTICA CHIMICA ACTA, 1983, 150 (01) : 153 - 161