Prediction of regulatory networks: genome-wide identification of transcription factor targets from gene expression data

被引:89
|
作者
Qian, J
Lin, J
Luscombe, NM
Yu, HY
Gerstein, M
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Johns Hopkins Med Sch, Dept Ophthalmol, Baltimore, MD 21287 USA
关键词
D O I
10.1093/bioinformatics/btg347
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Defining tegulatory networks, linking transcription factors (TFs) to their targets, is a central problem in post-genomic biology. One might imagine one could readily determine these networks through inspection of gene expression data. However, the relationship between the expression timecourse of a transcription factor and its target is not obvious (e.g. simple correlation over the timecourse), and current analysis methods, such as hierarchical clustering, have not been very successful in deciphering them. Results: Here we introduce an approach based on support vector machines (SVMs) to predict the targets of a transcription factor by identifying subtle relationships between their expression profiles. In particular, we used SVMs to predict the regulatory targets for 36 transcription factors in the Saccharomyces cerevisiae genome based on the microarray expression data from many different physiological conditions. We trained and tested our SVM on a data set constructed to include a significant number of both positive and negative examples, directly addressing data imbalance issues. This was non-trivial given that most of the known experimental information is only for positives. Overall, we found that 63% of our TF-target relationships were confirmed through cross-validation. We further assessed the performance of our regulatory network identifications by comparing them with the results from two recent genome-wide ChIP-chip experiments. Overall, we find the agreement between our results and these experiments is comparable to the agreement (albeit low) between the two experiments. We find that this network has a delocalized structure with respect to chromosomal positioning, with a given transcription factor having targets spread fairly uniformly across the genome.
引用
收藏
页码:1917 / 1926
页数:10
相关论文
共 50 条
  • [1] Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data
    Liu, Zhi-Ping
    [J]. CURRENT GENOMICS, 2015, 16 (01) : 3 - 22
  • [2] Genome-wide identification of the regulatory targets of a transcription factor using biochemical characterization and computational genomic analysis
    Jolly, ER
    Chin, CS
    Herskowitz, I
    Li, H
    [J]. BMC BIOINFORMATICS, 2005, 6 (1)
  • [3] Genome-wide identification of the regulatory targets of a transcription factor using biochemical characterization and computational genomic analysis
    Emmitt R Jolly
    Chen-Shan Chin
    Ira Herskowitz
    Hao Li
    [J]. BMC Bioinformatics, 6
  • [4] Genome-Wide Transcription Factor DNA Binding Sites and Gene Regulatory Networks in Clostridium thermocellum
    Hebdon, Skyler D.
    Gerritsen, Alida T.
    Chen, Yi-Pei
    Marcano, Joan G.
    Chou, Katherine J.
    [J]. FRONTIERS IN MICROBIOLOGY, 2021, 12
  • [5] Inference of gene regulatory networks from genome-wide knockout fitness data
    Wang, Liming
    Wang, Xiaodong
    Arkin, Adam P.
    Samoilov, Michael S.
    [J]. BIOINFORMATICS, 2013, 29 (03) : 338 - 346
  • [6] A compendium of genome-wide hematopoietic transcription factor maps supports the identification of gene regulatory control mechanisms
    Hannah, Rebecca
    Joshi, Anagha
    Wilson, Nicola K.
    Kinston, Sarah
    Goettgens, Berthold
    [J]. EXPERIMENTAL HEMATOLOGY, 2011, 39 (05) : 531 - 541
  • [7] Genome-Wide Identification, Characterization, and Expression Profiling of the Legume BZR Transcription Factor Gene Family
    Li, Yueying
    He, Liangliang
    Li, Jing
    Chen, Jianghua
    Liu, Changning
    [J]. FRONTIERS IN PLANT SCIENCE, 2018, 9
  • [8] Genome-Wide Identification of ARF Transcription Factor Gene Family and Their Expression Analysis in Sweet Potato
    Pratt, Isaac Seth
    Zhang, Baohong
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (17)
  • [9] Genome-Wide Identification and Expression Profiling of the BZR Transcription Factor Gene Family in Nicotiana benthamiana
    Chen, Xuwei
    Wu, Xinyang
    Qiu, Shiyou
    Zheng, Hongying
    Lu, Yuwen
    Peng, Jiejun
    Wu, Guanwei
    Chen, Jianping
    Rao, Shaofei
    Yan, Fei
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2021, 22 (19)
  • [10] Genome-Wide Identification and Expression Analysis of the PLATZ Transcription Factor in Tomato
    Zhang, Lifang
    Yang, Tao
    Wang, Zepeng
    Zhang, Fulin
    Li, Ning
    Jiang, Weijie
    [J]. PLANTS-BASEL, 2023, 12 (14):