A deep learning model to identify gene expression level using cobinding transcription factor signals

被引:23
|
作者
Zhang, Lirong [1 ]
Yang, Yanchao [2 ]
Chai, Lu [2 ]
Li, Qianzhong [1 ]
Liu, Junjie [1 ]
Lin, Hao [3 ]
Liu, Li [1 ]
机构
[1] Inner Mongolia Univ, Lab Theoret Biophys, Hohhot, Peoples R China
[2] Inner Mongolia Univ, Sch Phys Sci & Technol, 23 West Univ Rd, Hohhot 010021, Peoples R China
[3] Inner Mongolia Univ, Ctr Informat Biol, Hohhot, Peoples R China
基金
中国国家自然科学基金;
关键词
gene expression; transcription factor; TF interaction networks; convolutional neural network; FACTOR-BINDING; CHIP-SEQ; COLOCALIZATION; DNA;
D O I
10.1093/bib/bbab501
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Gene expression is directly controlled by transcription factors (TFs) in a complex combination manner. It remains a challenging task to systematically infer how the cooperative binding of TFs drives gene activity. Here, we quantitatively analyzed the correlation between TFs and surveyed the TF interaction networks associated with gene expression in GM12878 and K562 cell lines. We identified six TF modules associated with gene expression in each cell line. Furthermore, according to the enrichment characteristics of TFs in these TF modules around a target gene, a convolutional neural network model, called TFCNN, was constructed to identify gene expression level. Results showed that the TFCNN model achieved a good prediction performance for gene expression. The average of the area under receiver operating characteristics curve (AUC) can reach up to 0.975 and 0.976, respectively in GM12878 and K562 cell lines. By comparison, we found that the TFCNN model outperformed the prediction models based on SVM and LDA. This is due to the TFCNN model could better extract the combinatorial interaction among TFs. Further analysis indicated that the abundant binding of regulatory TFs dominates expression of target genes, while the cooperative interaction between TFs has a subtle regulatory effects. And gene expression could be regulated by different TF combinations in a nonlinear way. These results are helpful for deciphering the mechanism of TF combination regulating gene expression.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Integration of wounding and osmotic stress signals determines the expression of the AtMYB102 transcription factor gene
    Denekamp, M
    Smeekens, SC
    PLANT PHYSIOLOGY, 2003, 132 (03) : 1415 - 1423
  • [32] Deep Learning to Identify Transcription Start Sites from CAGE Data
    Zheng, Hansi
    Li, Xiaoman
    Hu, Haiyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 168 - 172
  • [33] Discovering transcription factor regulatory targets using gene expression and binding data
    Maienschein-Cline, Mark
    Zhou, Jie
    White, Kevin P.
    Sciammas, Roger
    Dinner, Aaron R.
    BIOINFORMATICS, 2012, 28 (02) : 206 - 213
  • [34] Deep learning for inferring transcription factor binding sites
    Koo, Peter K.
    Ploenzke, Matt
    CURRENT OPINION IN SYSTEMS BIOLOGY, 2020, 19 : 16 - 23
  • [35] Predicting Transcription Factor Binding Sites with Deep Learning
    Ghosh, Nimisha
    Santoni, Daniele
    Saha, Indrajit
    Felici, Giovanni
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (09)
  • [36] cTAP: A Machine Learning Framework for Predicting Target Genes of a Transcription Factor using a Cohort of Gene Expression Data Sets
    Wang, Honglin
    Joshi, Pujan
    Hong, Seung-Hyun
    Maye, Peter F.
    Rowe, David W.
    Shin, Dong-Guk
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 164 - 167
  • [37] Gene expression inference with deep learning
    Chen, Yifei
    Li, Yi
    Narayan, Rajiv
    Subramanian, Aravind
    Xie, Xiaohui
    BIOINFORMATICS, 2016, 32 (12) : 1832 - 1839
  • [38] A Cardiac Deep Learning Model (CDLM) to Predict and Identify the Risk Factor of Congenital Heart Disease
    Pachiyannan, Prabu
    Alsulami, Musleh
    Alsadie, Deafallah
    Saudagar, Abdul Khader Jilani
    AlKhathami, Mohammed
    Poonia, Ramesh Chandra
    DIAGNOSTICS, 2023, 13 (13)
  • [39] Feasibility Study of Deep Learning Based Radiosensitivity Binary Classification Model Using Gene Expression Profiling
    Kim, E.
    Chung, Y.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [40] Identification of a non-canonical transcription factor binding site using deep learning
    Proft, Sebastian
    Leiz, Janna
    Opitz, Robert
    Jung, Minie
    Heinemann, Udo
    Seelow, Dominik
    Schmidt-Ott, Kai
    Rutkiewicz, Maria
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 620 - 621