WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data

被引:0
|
作者
Hongbo Zhang
Lin Zhu
De-Shuang Huang
机构
[1] College of Electronics and Information Engineering,Institute of Machine Learning and Systems Biology
[2] Tongji University,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Although discriminative motif discovery (DMD) methods are promising for eliciting motifs from high-throughput experimental data, due to consideration of computational expense, most of existing DMD methods have to choose approximate schemes that greatly restrict the search space, leading to significant loss of predictive accuracy. In this paper, we propose Weakly-Supervised Motif Discovery (WSMD) to discover motifs from ChIP-seq datasets. In contrast to the learning strategies adopted by previous DMD methods, WSMD allows a “global” optimization scheme of the motif parameters in continuous space, thereby reducing the information loss of model representation and improving the quality of resultant motifs. Meanwhile, by exploiting the connection between DMD framework and existing weakly supervised learning (WSL) technologies, we also present highly scalable learning strategies for the proposed method. The experimental results on both real ChIP-seq datasets and synthetic datasets show that WSMD substantially outperforms former DMD methods (including DREME, HOMER, XXmotif, motifRG and DECOD) in terms of predictive accuracy, while also achieving a competitive computational speed.
引用
收藏
相关论文
共 50 条
  • [1] WSMD: weakly-supervised motif discovery in transcription factor ChIP-seq data
    Zhang, Hongbo
    Zhu, Lin
    Huang, De-Shuang
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [2] DREME: motif discovery in transcription factor ChIP-seq data
    Bailey, Timothy L.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1653 - 1659
  • [3] coMOTIF: a mixture framework for identifying transcription factor and a coregulator motif in ChIP-seq Data
    Xu, Mengyuan
    Weinberg, Clarice R.
    Umbach, David M.
    Li, Leping
    [J]. BIOINFORMATICS, 2011, 27 (19) : 2625 - 2632
  • [4] A Clustering Approach for Motif Discovery in ChIP-Seq Dataset
    Sun, Chun-xiao
    Yang, Yu
    Wang, Hua
    Wang, Wen-hu
    [J]. ENTROPY, 2019, 21 (08)
  • [5] Extracting transcription factor targets from ChIP-Seq data
    Tuteja, Geetu
    White, Peter
    Schug, Jonathan
    Kaestner, Klaus H.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (17) : e113 - e113
  • [6] Inferring transcription factor complexes from ChIP-seq data
    Whitington, Tom
    Frith, Martin C.
    Johnson, James
    Bailey, Timothy L.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (15) : e98
  • [7] A review of ensemble methods for de novo motif discovery in ChIP-Seq data
    Lihu, Andrei
    Holban, Stefan
    [J]. BRIEFINGS IN BIOINFORMATICS, 2015, 16 (06) : 964 - 973
  • [8] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Hunt, Rebecca Worsley
    Mathelier, Anthony
    del Peso, Luis
    Wasserman, Wyeth W.
    [J]. BMC GENOMICS, 2014, 15
  • [9] Improving analysis of transcription factor binding sites within ChIP-Seq data based on topological motif enrichment
    Rebecca Worsley Hunt
    Anthony Mathelier
    Luis del Peso
    Wyeth W Wasserman
    [J]. BMC Genomics, 15
  • [10] Identifying differential transcription factor binding in ChIP-seq
    Wu, Dai-Ying
    Bittencourt, Danielle
    Stallcup, Michael R.
    Siegmund, Kimberly D.
    [J]. FRONTIERS IN GENETICS, 2015, 6