Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections

被引:27
|
作者
El-Assady, Mennatallah [1 ,2 ]
Kehlbeck, Rebecca [1 ]
Collins, Christopher [2 ]
Keim, Daniel [1 ]
Deussen, Oliver [1 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] Ontario Tech Univ, Oshawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Topic Model Optimization; Word Embedding; Mixed-Initiative Refinement; Guided Visual Analytics; Semantic Mapping; VISUAL ANALYTICS;
D O I
10.1109/TVCG.2019.2934654
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a framework that allows users to incorporate the semantics of their domain knowledge for topic model refinement while remaining model-agnostic. Our approach enables users to (1) understand the semantic space of the model, (2) identify regions of potential conflicts and problems, and (3) readjust the semantic relation of concepts based on their understanding, directly influencing the topic modeling. These tasks are supported by an interactive visual analytics workspace that uses word-embedding projections to define concept regions which can then be refined. The user-refined concepts are independent of a particular document collection and can be transferred to related corpora. All user interactions within the concept space directly affect the semantic relations of the underlying vector space model, which, in turn, change the topic modeling. In addition to direct manipulation, our system guides the users decision-making process through recommended interactions that point out potential improvements. This targeted refinement aims at minimizing the feedback required for an efficient human-in-the-loop process. We confirm the improvements achieved through our approach in two user studies that show topic model quality improvements through our visual knowledge externalization and learning process.
引用
收藏
页码:1001 / 1011
页数:11
相关论文
共 50 条
  • [41] A novel model for semantic similarity measurement based on wordnet and word embedding
    Zhao, Fuqiang
    Zhu, Zhengyu
    Han, Ping
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (05) : 9831 - 9842
  • [42] Topic segmentation using word-level semantic relatedness functions
    Ercan, Gonenc
    Cicekli, Ilyas
    JOURNAL OF INFORMATION SCIENCE, 2016, 42 (05) : 597 - 608
  • [43] Semantic Sparse Service Discovery Using Word Embedding and Gaussian LDA
    Tian, Gang
    Zhao, Shengtao
    Wang, Jian
    Zhao, Ziqi
    Liu, Junju
    Guo, Lantian
    IEEE ACCESS, 2019, 7 : 88231 - 88242
  • [44] Semantic concept model using Wikipedia semantic features
    Saif, Abdulgabbar
    Omar, Nazlia
    Ab Aziz, Mohd Juzaiddin
    Zainodin, Ummi Zakiah
    Salim, Naomie
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 526 - 551
  • [45] GLTM: A Global and Local Word Embedding-Based Topic Model for Short Texts
    Liang, Wenxin
    Feng, Ran
    Liu, Xinyue
    Li, Yuangang
    Zhang, Xianchao
    IEEE ACCESS, 2018, 6 : 43612 - 43621
  • [46] Word Sense Induction using Correlated Topic Model
    Thanh Tung Hoang
    Phuong Thai Nguyen
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 41 - 44
  • [47] Word Sense Disambiguation using Author Topic Model
    Kaneishi, Shougo
    Tajima, Takuya
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INDEPENDENT COMPUTING (ISIC), 2014, : 78 - 83
  • [48] Vector Representation of Bengali Word Using Various Word Embedding Model
    Rafat, Ashik Ahamed Aman
    Salehin, Mushfiqus
    Khan, Fazle Rabby
    Hossain, Syed Akhter
    Abujar, Sheikh
    PROCEEDINGS OF THE 2019 8TH INTERNATIONAL CONFERENCE ON SYSTEM MODELING & ADVANCEMENT IN RESEARCH TRENDS (SMART-2019), 2019, : 27 - 30
  • [49] Comparisons of the City Brand Influence of Global Cities: Word-Embedding Based Semantic Mining and Clustering Analysis on the Big Data of GDELT Global News Knowledge Graph
    Zheng, Chenyu
    SUSTAINABILITY, 2020, 12 (16)
  • [50] Semantic-enhanced topic evolution analysis: a combination of the dynamic topic model and word2vec
    Gao, Qiang
    Huang, Xiao
    Dong, Ke
    Liang, Zhentao
    Wu, Jiang
    SCIENTOMETRICS, 2022, 127 (03) : 1543 - 1563