A deep learning model to identify gene expression level using cobinding transcription factor signals

被引:23
|
作者
Zhang, Lirong [1 ]
Yang, Yanchao [2 ]
Chai, Lu [2 ]
Li, Qianzhong [1 ]
Liu, Junjie [1 ]
Lin, Hao [3 ]
Liu, Li [1 ]
机构
[1] Inner Mongolia Univ, Lab Theoret Biophys, Hohhot, Peoples R China
[2] Inner Mongolia Univ, Sch Phys Sci & Technol, 23 West Univ Rd, Hohhot 010021, Peoples R China
[3] Inner Mongolia Univ, Ctr Informat Biol, Hohhot, Peoples R China
基金
中国国家自然科学基金;
关键词
gene expression; transcription factor; TF interaction networks; convolutional neural network; FACTOR-BINDING; CHIP-SEQ; COLOCALIZATION; DNA;
D O I
10.1093/bib/bbab501
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression is directly controlled by transcription factors (TFs) in a complex combination manner. It remains a challenging task to systematically infer how the cooperative binding of TFs drives gene activity. Here, we quantitatively analyzed the correlation between TFs and surveyed the TF interaction networks associated with gene expression in GM12878 and K562 cell lines. We identified six TF modules associated with gene expression in each cell line. Furthermore, according to the enrichment characteristics of TFs in these TF modules around a target gene, a convolutional neural network model, called TFCNN, was constructed to identify gene expression level. Results showed that the TFCNN model achieved a good prediction performance for gene expression. The average of the area under receiver operating characteristics curve (AUC) can reach up to 0.975 and 0.976, respectively in GM12878 and K562 cell lines. By comparison, we found that the TFCNN model outperformed the prediction models based on SVM and LDA. This is due to the TFCNN model could better extract the combinatorial interaction among TFs. Further analysis indicated that the abundant binding of regulatory TFs dominates expression of target genes, while the cooperative interaction between TFs has a subtle regulatory effects. And gene expression could be regulated by different TF combinations in a nonlinear way. These results are helpful for deciphering the mechanism of TF combination regulating gene expression.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Predicting gene expression level by the transcription factor binding signals in human embryonic stem cells
    Zhang, Lu-Qiang
    Li, Qian-Zhong
    Su, Wen-Xia
    Jin, Wen
    BIOSYSTEMS, 2016, 150 : 92 - 98
  • [2] A Predictive Model of Gene Expression Using a Deep Learning Framework
    Xie, Rui
    Quitadamo, Andrew
    Cheng, Jianlin
    Shi, Xinghua
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 676 - 681
  • [3] The Transcription Factor Titration Effect Dictates Level of Gene Expression
    Weinert, Franz M.
    Brewster, Robert C.
    Garcia, Hernan G.
    Song, Linda
    Rydenfelt, Mattias
    Phillips, Rob
    BIOPHYSICAL JOURNAL, 2014, 106 (02) : 489A - 489A
  • [4] The Transcription Factor Titration Effect Dictates Level of Gene Expression
    Brewster, Robert C.
    Weinert, Franz M.
    Garcia, Hernan G.
    Song, Dan
    Rydenfelt, Mattias
    Phillips, Rob
    CELL, 2014, 156 (06) : 1312 - 1323
  • [5] A quantitative model of transcription factor–activated gene expression
    Harold D Kim
    Erin K O'Shea
    Nature Structural & Molecular Biology, 2008, 15 : 1192 - 1198
  • [6] A quantitative model of transcription factor-activated gene expression
    Kim, Harold D.
    O'Shea, Erin K.
    NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2008, 15 (11) : 1192 - 1198
  • [7] Stochastic model of transcription factor-regulated gene expression
    Karmakar, Rajesh
    Bose, Indrani
    PHYSICAL BIOLOGY, 2006, 3 (03) : 200 - 208
  • [8] Base-resolution prediction of transcription factor binding signals by a deep learning framework
    Zhang, Qinhu
    He, Ying
    Wang, Siguo
    Chen, Zhanheng
    Guo, Zhenhao
    Cui, Zhen
    Liu, Qi
    Huang, De-Shuang
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (03)
  • [9] A skellam model to identify differential patterns of gene expression induced by environmental signals
    Jiang, Libo
    Mao, Ke
    Wu, Rongling
    BMC GENOMICS, 2014, 15
  • [10] DeepTFBS: A Hybrid Model Using Deep Learning Methods for Transcription Factor Binding Sites Prediction
    Hatipoglu, Aysegul
    Altuntas, Volkan
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2024,