A Method for Analyzing Solution Diversity in Topic Models

被引:0
|
作者
Uchiyama, Toshio [1 ]
机构
[1] Hokkaido Informat Univ, Dept Syst & Informat, Ebetsu, Hokkaido, Japan
关键词
topic model; PLSA; diversity of solutions; normalized mutual information; information-theoretic clustering;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A topic model is a statistical model for modeling high dimensional count data. Many different parameters (solutions) of a topic model can be obtained through a learning algorithm due to different initial conditions. This paper focuses on diversity of solutions. To utilize diversity of solutions, it is necessary to acquire distribution structure of them. Therefore, this paper proposes a novel method to define similarity (inner product) of solutions using normalized mutual information to analyze distribution of solutions. Experimental results for text data are presented to show the usefulness of the proposed method.
引用
收藏
页码:29 / 34
页数:6
相关论文
共 50 条
  • [1] Analyzing Bayesian Crosslingual Transfer in Topic Models
    Hao, Shudong
    Paul, Michael J.
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 1551 - 1565
  • [2] Analyzing user reviews in tourism with topic models
    Rossetti, Marco
    Stella, Fabio
    Zanker, Markus
    INFORMATION TECHNOLOGY & TOURISM, 2016, 16 (01) : 5 - 21
  • [3] Analyzing the history of Cognition using Topic Models
    Priva, Uriel Cohen
    Austerweil, Joseph L.
    COGNITION, 2015, 135 : 4 - 9
  • [4] Analyzing temporal patterns of topic diversity using graph clustering
    Takako Hashimoto
    David Lawrence Shepard
    Tetsuji Kuboyama
    Kilho Shin
    Ryota Kobayashi
    Takeaki Uno
    The Journal of Supercomputing, 2021, 77 : 4375 - 4388
  • [5] Analyzing temporal patterns of topic diversity using graph clustering
    Hashimoto, Takako
    Shepard, David Lawrence
    Kuboyama, Tetsuji
    Shin, Kilho
    Kobayashi, Ryota
    Uno, Takeaki
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (05): : 4375 - 4388
  • [6] Diagnosing and Improving Topic Models by Analyzing Posterior Variability
    Xing, Linzi
    Paul, Michael J.
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 6005 - 6012
  • [7] Analyzing Topic Models: A Tourism Recommender System Perspective
    Kamal, Maryam
    Romani, Gianfranco
    Ricciuti, Giuseppe
    Anagnostopoulos, Aris
    Chatzigiannakis, Ioannis
    ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 2, AINA 2024, 2024, 200 : 250 - 262
  • [8] Analyzing #LasTesis Feminist Movement in Twitter Using Topic Models
    Rodriguez, Sebastian
    Allende-Cid, Hector
    Gonzalez, Cristian
    Alfaro, Rodrigo
    Elortegui, Claudio
    Palma, Wenceslao
    Santander, Pedro
    SOCIAL COMPUTING AND SOCIAL MEDIA. DESIGN, ETHICS, USER BEHAVIOR, AND SOCIAL NETWORK ANALYSIS, SCSM 2020, PT I, 2020, 12194 : 624 - 635
  • [9] Analyzing entities and topics in news articles using statistical topic models
    Newman, David
    Chemudugunta, Chaitanya
    Smyth, Padhraic
    Steyvers, Mark
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2006, 3975 : 93 - 104
  • [10] PGeoTopic: A Distributed Solution for Mining Geographical Topic Models
    Zhao, Kaiqi
    Cong, Gao
    Li, Xiucheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (02) : 881 - 893