Computational prediction and characterization of cell-type-specific and shared binding sites

被引:7
|
作者
Zhang, Qinhu [1 ,2 ]
Teng, Pengrui [3 ]
Wang, Siguo [4 ]
He, Ying [4 ]
Cui, Zhen [4 ]
Guo, Zhenghao [4 ]
Liu, Yixin [5 ]
Yuan, Changan [6 ]
Liu, Qi [1 ,2 ]
Huang, De-Shuang [7 ]
机构
[1] Tongji Univ, Translat Med Ctr Stem Cell Therapy, Shanghai 200092, Peoples R China
[2] Tongji Univ, Shanghai East Hosp, Inst Regenerat Med, Sch Life Sci & Technol,Bioinformat Dept, Shanghai 200092, Peoples R China
[3] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Peoples R China
[4] Tongji Univ, Inst Machine Learning & Syst Biol, Sch Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Univ Shanghai Sci & Technol, Sch Hlth Sci & Engn, Shanghai 200093, Peoples R China
[6] Guangxi Acad Sci, Big Data & Intelligent Comp Res Ctr, Nanning 530007, Peoples R China
[7] EIT Inst Adv Study, Ningbo 315201, Zhejiang, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
CHIP-SEQ; DNA; SEQUENCE; REVEALS;
D O I
10.1093/bioinformatics/btac798
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Cell-type-specific gene expression is maintained in large part by transcription factors (TFs) selectively binding to distinct sets of sites in different cell types. Recent research works have provided evidence that such cell-type-specific binding is determined by TF's intrinsic sequence preferences, cooperative interactions with co-factors, cell-type-specific chromatin landscapes and 3D chromatin interactions. However, computational prediction and characterization of cell-type-specific and shared binding sites is rarely studied. Results: In this article, we propose two computational approaches for predicting and characterizing cell-type-specific and shared binding sites by integrating multiple types of features, in which one is based on XGBoost and another is based on convolutional neural network (CNN). To validate the performance of our proposed approaches, ChIP-seq datasets of 10 binding factors were collected from the GM12878 (lymphoblastoid) and K562 (erythroleukemic) human hematopoietic cell lines, each of which was further categorized into cell-type-specific (GM12878- and K562-specific) and shared binding sites. Then, multiple types of features for these binding sites were integrated to train the XGBoost- and CNN-based models. Experimental results show that our proposed approaches significantly outperform other competing methods on three classification tasks. Moreover, we identified independent feature contributions for cell-type-specific and shared sites through SHAP values and explored the ability of the CNN-based model to predict cell-type-specific and shared binding sites by excluding or including DNase signals. Furthermore, we investigated the generalization ability of our proposed approaches to different binding factors in the same cellular environment.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Distinct Properties of Cell-Type-Specific and Shared Transcription Factor Binding Sites
    Gertz, Jason
    Savic, Daniel
    Varley, Katherine E.
    Partridge, E. Christopher
    Safi, Alexias
    Jain, Preti
    Cooper, Gregory M.
    Reddy, Timothy E.
    Crawford, Gregory E.
    Myers, Richard M.
    MOLECULAR CELL, 2013, 52 (01) : 25 - 36
  • [2] RNA-DamID reveals cell-type-specific binding of roX RNAs at chromatin-entry sites
    Cheetham, Seth W.
    Brand, Andrea H.
    NATURE STRUCTURAL & MOLECULAR BIOLOGY, 2018, 25 (01) : 809 - +
  • [3] RNA-DamID reveals cell-type-specific binding of roX RNAs at chromatin-entry sites
    Seth W. Cheetham
    Andrea H. Brand
    Nature Structural & Molecular Biology, 2018, 25 : 109 - 114
  • [4] γ-Synuclein: Cell-Type-Specific Promoter Activity and Binding to Transcription Factors
    Irina Surgucheva
    Andrei Surguchov
    Journal of Molecular Neuroscience, 2008, 35 : 267 - 271
  • [5] γ-synuclein:: Cell-type-specific promoter activity and binding to transcription factors
    Surgucheva, Irina
    Surguchov, Andrei
    JOURNAL OF MOLECULAR NEUROSCIENCE, 2008, 35 (03) : 267 - 271
  • [6] Sequence and chromatin determinants of cell-type-specific transcription factor binding
    Arvey, Aaron
    Agius, Phaedra
    Noble, William Stafford
    Leslie, Christina
    GENOME RESEARCH, 2012, 22 (09) : 1723 - 1734
  • [7] Cell-type-specific metabolism in plants
    Daloso, Danilo de Menezes
    Morais, Eva Gomes
    Oliveira e Silva, Karen Fernanda
    Williams, Thomas Christopher Rhys
    PLANT JOURNAL, 2023, 114 (05): : 1093 - 1114
  • [8] CELL-TYPE-SPECIFIC TRANSCRIPTION IN YEAST
    DOLAN, JW
    FIELDS, S
    BIOCHIMICA ET BIOPHYSICA ACTA, 1991, 1088 (02) : 155 - 169
  • [9] Cell-Type-Specific Optogenetics in Monkeys
    Namboodiri, Vijay Mohan K.
    Stuber, Garret D.
    CELL, 2016, 166 (06) : 1366 - 1368
  • [10] Cell-Type-Specific Neuroproteomics of Synapses
    Yim, Yun Young
    Nestler, Eric J.
    BIOMOLECULES, 2023, 13 (06)