Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections

被引:27
|
作者
El-Assady, Mennatallah [1 ,2 ]
Kehlbeck, Rebecca [1 ]
Collins, Christopher [2 ]
Keim, Daniel [1 ]
Deussen, Oliver [1 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] Ontario Tech Univ, Oshawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Topic Model Optimization; Word Embedding; Mixed-Initiative Refinement; Guided Visual Analytics; Semantic Mapping; VISUAL ANALYTICS;
D O I
10.1109/TVCG.2019.2934654
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a framework that allows users to incorporate the semantics of their domain knowledge for topic model refinement while remaining model-agnostic. Our approach enables users to (1) understand the semantic space of the model, (2) identify regions of potential conflicts and problems, and (3) readjust the semantic relation of concepts based on their understanding, directly influencing the topic modeling. These tasks are supported by an interactive visual analytics workspace that uses word-embedding projections to define concept regions which can then be refined. The user-refined concepts are independent of a particular document collection and can be transferred to related corpora. All user interactions within the concept space directly affect the semantic relations of the underlying vector space model, which, in turn, change the topic modeling. In addition to direct manipulation, our system guides the users decision-making process through recommended interactions that point out potential improvements. This targeted refinement aims at minimizing the feedback required for an efficient human-in-the-loop process. We confirm the improvements achieved through our approach in two user studies that show topic model quality improvements through our visual knowledge externalization and learning process.
引用
收藏
页码:1001 / 1011
页数:11
相关论文
共 50 条
  • [1] Prediction of Semantically Correct Bangla Words Using Stupid Backoff and Word-Embedding Model
    Mittra, Tanni
    Islam, Linta
    Roy, Deepak Chandra
    2019 2ND INTERNATIONAL CONFERENCE ON APPLIED INFORMATION TECHNOLOGY AND INNOVATION (ICAITI2019), 2019, : 66 - 70
  • [2] Combine Topic Modeling with Semantic Embedding: Embedding Enhanced Topic Model
    Zhang, Peng
    Wang, Suge
    Li, Deyu
    Li, Xiaoli
    Xu, Zhikang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (12) : 2322 - 2335
  • [3] A Layered Approach to Automatic Essay Evaluation Using Word-Embedding
    Tashu, Tsegaye Misikir
    Horvath, Tomas
    COMPUTER SUPPORTED EDUCATION, 2019, 1022 : 77 - 94
  • [4] A Word Embedding Model For Topic Recommendation
    Kannan, Megala S.
    Mahalakshmi, G. S.
    Smitha, E. S.
    Sendhilkumar, S.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 826 - 830
  • [5] A Word Embedding Model For Topic Recommendation
    Kannan, Megala S.
    Mahalakshmi, G. S.
    Smitha, E. S.
    Sendhilkumar, S.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 1307 - 1311
  • [6] Word-Embedding Model for Evaluating Text Generation of Imbalanced Spam Reviews
    Purwitasari, Diana
    Zaqiyah, Ana Alimatus
    Fatichah, Chastine
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 21 - 26
  • [7] Software Sentiment Analysis Using Machine Learning with Different Word-Embedding
    Mula, Venkata Krishna Chandra
    Vijayvargiya, Sanidhya
    Kumar, Lov
    Samant, Surender Singh
    Murthy, Lalita Bhanu
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2022 WORKSHOPS, PART V, 2022, 13381 : 396 - 410
  • [8] A Latent Concept Topic Model for Robust Topic Inference Using Word Embeddings
    Hu, Weihua
    Tsujii, Jun'ichi
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 380 - 386
  • [9] Evolutions of semantic consistency in research topic via contextualized word embedding
    Huang, Shengzhi
    Lu, Wei
    Cheng, Qikai
    Luo, Zhuoran
    Huang, Yong
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (06)
  • [10] Short Text Embedding for Clustering based on Word and Topic Semantic Information
    Chen, Ziheng
    Ren, Jiangtao
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 61 - 70