A Method for Analyzing Solution Diversity in Topic Models

被引:0
|
作者
Uchiyama, Toshio [1 ]
机构
[1] Hokkaido Informat Univ, Dept Syst & Informat, Ebetsu, Hokkaido, Japan
关键词
topic model; PLSA; diversity of solutions; normalized mutual information; information-theoretic clustering;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A topic model is a statistical model for modeling high dimensional count data. Many different parameters (solutions) of a topic model can be obtained through a learning algorithm due to different initial conditions. This paper focuses on diversity of solutions. To utilize diversity of solutions, it is necessary to acquire distribution structure of them. Therefore, this paper proposes a novel method to define similarity (inner product) of solutions using normalized mutual information to analyze distribution of solutions. Experimental results for text data are presented to show the usefulness of the proposed method.
引用
收藏
页码:29 / 34
页数:6
相关论文
共 50 条
  • [21] METHOD FOR ANALYZING MATHEMATICAL MODELS IN A FACTOR SPACE
    LAPIN, YN
    STRELTSO.AA
    INDUSTRIAL LABORATORY, 1969, 35 (03): : 405 - &
  • [22] A method for analyzing the vibrational energy flow in biomolecules in solution
    Angel Soler, Miguel
    Bastida, Adolfo
    Farag, Marwa H.
    Zuniga, Jose
    Requena, Alberto
    JOURNAL OF CHEMICAL PHYSICS, 2011, 135 (20):
  • [23] Topic Models with Topic Ordering Regularities for Topic Segmentation
    Du, Lan
    Pate, John K.
    Johnson, Mark
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 803 - 808
  • [24] Analyzing discourse topics and topic keywords
    Todd, Richard Watson
    SEMIOTICA, 2011, 184 (1-4) : 251 - 270
  • [25] Method of Moments for Topic Models with Mixed Discrete and Continuous Features
    Giesen, Joachim
    Kahlmeyer, Paul
    Laue, Soeren
    Mitterreiter, Matthias
    Nussbaum, Frank
    Staudt, Christoph
    Zarriess, Sina
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2418 - 2424
  • [26] A Flexible Stochastic Method for Solving the MAP Problem in Topic Models
    Tu Vu
    Xuan Bui
    Khoat Than
    Ichise, Ryutaro
    COMPUTACION Y SISTEMAS, 2018, 22 (04): : 1317 - 1327
  • [27] Compensation: an alternative method for analyzing diversity-productivity experiments
    Adler, PB
    Bradford, JB
    OIKOS, 2002, 96 (03) : 411 - 420
  • [28] A method of refining topic models based on term and document frequencies
    Higashi K.
    Takahashi H.
    Nakagawa H.
    Tsuchiya T.
    Computer Software, 2019, 36 (04) : 25 - 31
  • [29] CoTE: A Flexible Method for Joint Learning of Topic and Embedding Models
    Zhao, Bo
    Yuan, Chunfeng
    Huang, Yihua
    WEB AND BIG DATA, PT IV, APWEB-WAIM 2023, 2024, 14334 : 406 - 421
  • [30] CONSTRUCTION AND REFLECTION OF CONCEPTUAL MODELS AS A METHOD FOR ANALYZING EXPERTISE
    ETELAPELTO, A
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 573 - 573