Hierarchical Theme and Topic Modeling

被引:22
|
作者
Chien, Jen-Tzung [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Comp Engn, Hsinchu 30010, Taiwan
关键词
Bayesian nonparametrics (BNPs); document summarization; structural learning; topic model; POISSON-DIRICHLET; LATENT;
D O I
10.1109/TNNLS.2015.2414658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Considering the hierarchical data groupings in text corpus, e.g., words, sentences, and documents, we conduct the structural learning and infer the latent themes and topics for sentences and words from a collection of documents, respectively. The relation between themes and topics under different data groupings is explored through an unsupervised procedure without limiting the number of clusters. A tree stick-breaking process is presented to draw theme proportions for different sentences. We build a hierarchical theme and topic model, which flexibly represents the heterogeneous documents using Bayesian nonparametrics. Thematic sentences and topical words are extracted. In the experiments, the proposed method is evaluated to be effective to build semantic tree structure for sentences and the corresponding words. The superiority of using tree model for selection of expressive sentences for document summarization is illustrated.
引用
收藏
页码:565 / 578
页数:14
相关论文
共 50 条
  • [31] Hierarchical Summarization of Text Documents Using Topic Modeling and Formal Concept Analysis
    Akhtar, Nadeem
    Javed, Hira
    Ahmad, Tameem
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2018, VOL 2, 2019, 839 : 21 - 33
  • [32] Lifelong Hierarchical Topic Modeling via Non-negative Matrix Factorization
    Lin, Zhicheng
    Yan, Jiaxing
    Lei, Zhiqi
    Rao, Yanghui
    WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 155 - 170
  • [33] Nonlinear Structural Equation Model Guided Gaussian Mixture Hierarchical Topic Modeling
    Chen, Hegang
    Mao, Pengbo
    Lu, Yuyin
    Rao, Yanghui
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 10377 - 10390
  • [34] Toward theme development analysis with topic clustering
    Geng, Xueyu
    Wang, Jinlong
    2008 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER THEORY AND ENGINEERING, 2008, : 628 - +
  • [35] Special Topic Issue on the Theme of Aromaticity Foreword
    Bull, James R.
    PURE AND APPLIED CHEMISTRY, 2010, 82 (04) : III - III
  • [36] Neural Topic Models for Hierarchical Topic Detection and Visualization
    Pham, Dang
    Le, Than M., V
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2021: RESEARCH TRACK, PT III, 2021, 12977 : 35 - 51
  • [37] Theme topic: Round table about bioavailability
    Tassy, A
    JOURNAL FRANCAIS D OPHTALMOLOGIE, 2000, 23 (05): : 499 - 499
  • [38] Theme topic: Round table "blepharoplasty" - Foreword
    Morax, S
    JOURNAL FRANCAIS D OPHTALMOLOGIE, 2004, 27 (06): : 634 - 634
  • [39] MAIN THEME, THE INTRAMEDULLARY PIN - INTRODUCTION TO THE TOPIC
    REHN, J
    UNFALLCHIRURG, 1990, 93 (11): : 487 - 487
  • [40] A Topic Model for Hierarchical Documents
    Yang, Yang
    Wang, Feifei
    Jiang, Fei
    Jin, Shuyuan
    Xu, Jin
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 118 - 126