Network-based topic structure visualization

被引:0
|
作者
Jeon, Yeseul [1 ,2 ]
Park, Jina [1 ,2 ]
Jin, Ick Hoon [1 ,2 ]
Chung, Dongjun [3 ]
机构
[1] Yonsei Univ, Dept Stat & Data Sci, 50 Yonsei Ro, Seoul 03722, South Korea
[2] Yonsei Univ, Dept Appl Stat, 50 Yonsei Ro, Seoul 03722, South Korea
[3] Ohio State Univ, Dept Biomed Informat, Lincoln Tower Room 250,1800 Cannon Dr, Columbus, OH 43210 USA
基金
新加坡国家研究基金会; 美国国家卫生研究院;
关键词
Latent space item response model; topic embedding; topic structure visualization; text mining; network analysis; MODEL;
D O I
10.1080/02664763.2024.2369953
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the real world, many topics are inter-correlated, making it challenging to investigate their structure and relationships. Understanding the interplay between topics and their relevance can provide valuable insights for researchers, guiding their studies and informing the direction of research. In this paper, we utilize the topic-words distribution, obtained from topic models, as item-response data to model the structure of topics using a latent space item response model. By estimating the latent positions of topics based on their distances toward words, we can capture the underlying topic structure and reveal their relationships. Visualizing the latent positions of topics in Euclidean space allows for an intuitive understanding of their proximity and associations. We interpret relationships among topics by characterizing each topic based on representative words selected using a newly proposed scoring scheme. Additionally, we assess the maturity of topics by tracking their latent positions using different word sets, providing insights into the robustness of topics. To demonstrate the effectiveness of our approach, we analyze the topic composition of COVID-19 studies during the early stage of its emergence using biomedical literature in the PubMed database. The software and data used in this paper are publicly available at https://github.com/jeon9677/gViz.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Siamese Network-Based Supervised Topic Modeling
    Huang, Minghui
    Rao, Yanghui
    Liu, Yuwei
    Xie, Haoran
    Wang, Fu Lee
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 4652 - 4662
  • [2] Robustness and Sensitivity of Network-Based Topic Detection
    Galluccio, Carla
    Magnani, Matteo
    Vega, Davide
    Ragozini, Giancarlo
    Petrucci, Alessandra
    [J]. COMPLEX NETWORKS AND THEIR APPLICATIONS XI, COMPLEX NETWORKS 2022, VOL 2, 2023, 1078 : 259 - 270
  • [3] CompositeView: A Network-Based Visualization Tool
    Allegri, Stephen A.
    McCoy, Kevin
    Mitchell, Cassie S.
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
  • [4] Coauthorship network-based literature recommendation with topic model
    Hwang, San-Yih
    Wei, Chih-Ping
    Lee, Chien-Hsiang
    Chen, Yu-Siang
    [J]. ONLINE INFORMATION REVIEW, 2017, 41 (03) : 318 - 336
  • [5] Topic Modeling for Management Sciences: A Network-based Approach
    Menenberg, Max
    Pathak, Surva
    Udyapuram, Hari P.
    Gavirneni, Srinagesh
    Roychowdhury, Sohini
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3509 - 3518
  • [6] Visualization of topic distribution based on immune network model
    Takama, Y
    [J]. GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT I, PROCEEDINGS, 2003, 2723 : 246 - 247
  • [7] Analyzing the generalizability of the network-based topic emergence identification method
    Jung, Sukhwan
    Segev, Aviv
    [J]. SEMANTIC WEB, 2022, 13 (03) : 423 - 439
  • [8] Neural network-based visualization using clustered data
    Ivanikovas, Sergejus
    Dzemyda, Gintautas
    Medvedev, Viktor
    [J]. 20TH INTERNATIONAL CONFERENCE, EURO MINI CONFERENCE CONTINUOUS OPTIMIZATION AND KNOWLEDGE-BASED TECHNOLOGIES, EUROPT'2008, 2008, : 335 - 341
  • [9] NaviCell Web Service for network-based data visualization
    Bonnet, Eric
    Viara, Eric
    Kuperstein, Inna
    Calzone, Laurence
    Cohen, David P. A.
    Barillot, Emmanuel
    Zinovyev, Andrei
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (W1) : W560 - W565
  • [10] NETWORK-BASED STRUCTURE FLOW ESTIMATION
    Liu, Shu
    Barnes, Nick
    Mahony, Robert
    Ye, Haolei
    [J]. 2020 DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2020,