Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections

被引:27
|
作者
El-Assady, Mennatallah [1 ,2 ]
Kehlbeck, Rebecca [1 ]
Collins, Christopher [2 ]
Keim, Daniel [1 ]
Deussen, Oliver [1 ]
机构
[1] Univ Konstanz, Constance, Germany
[2] Ontario Tech Univ, Oshawa, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Topic Model Optimization; Word Embedding; Mixed-Initiative Refinement; Guided Visual Analytics; Semantic Mapping; VISUAL ANALYTICS;
D O I
10.1109/TVCG.2019.2934654
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a framework that allows users to incorporate the semantics of their domain knowledge for topic model refinement while remaining model-agnostic. Our approach enables users to (1) understand the semantic space of the model, (2) identify regions of potential conflicts and problems, and (3) readjust the semantic relation of concepts based on their understanding, directly influencing the topic modeling. These tasks are supported by an interactive visual analytics workspace that uses word-embedding projections to define concept regions which can then be refined. The user-refined concepts are independent of a particular document collection and can be transferred to related corpora. All user interactions within the concept space directly affect the semantic relations of the underlying vector space model, which, in turn, change the topic modeling. In addition to direct manipulation, our system guides the users decision-making process through recommended interactions that point out potential improvements. This targeted refinement aims at minimizing the feedback required for an efficient human-in-the-loop process. We confirm the improvements achieved through our approach in two user studies that show topic model quality improvements through our visual knowledge externalization and learning process.
引用
收藏
页码:1001 / 1011
页数:11
相关论文
共 50 条
  • [21] WERECE: An Unsupervised Method for Educational Concept Extraction Based on Word Embedding Refinement
    Huang, Jingxiu
    Ding, Ruofei
    Wu, Xiaomin
    Chen, Shumin
    Zhang, Jiale
    Liu, Lixiang
    Zheng, Yunxiang
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [22] A Domain Expertise and Word-Embedding Geometric Projection Based Semantic Mining Framework for Measuring the Soft Power of Social Entities
    Zheng, Chenyu
    Fan, Hong
    Singh, Rohit
    Shi, Yuanyuan
    IEEE ACCESS, 2020, 8 (08): : 204597 - 204611
  • [23] ONLINE ADAPTATIVE ZERO-SHOT LEARNING SPOKEN LANGUAGE UNDERSTANDING USING WORD-EMBEDDING
    Ferreira, Emmanuel
    Jabaian, Bassam
    Lefevre, Fabrice
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5321 - 5325
  • [24] Feature Expansion using Word Embedding for Tweet Topic Classification
    Setiawan, Erwin B.
    Widyantoro, Dwi H.
    Surendro, Kridanto
    2016 10TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATION SYSTEMS SERVICES AND APPLICATIONS (TSSA), 2016,
  • [25] Understanding the semantic change of Hangeul using word embedding
    Sun, Hyunseok
    Lee, Yung-Seop
    Lim, Changwon
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (03) : 295 - 308
  • [26] Short Text Clustering based on Word Semantic Graph with Word Embedding Model
    Jinarat, Supakpong
    Manaskasemsak, Bundit
    Rungsawang, Arnon
    2018 JOINT 10TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 19TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2018, : 1427 - 1432
  • [27] An Effective Way of Word-level Language Identification for Code-mixed Facebook comments using Word-Embedding via Character-embedding
    Veena, P. V.
    Kumar, Anand M.
    Soman, K. P.
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 1552 - 1556
  • [28] Exploiting word embedding for heterogeneous topic model towards patent recommendation
    Chen, Jie
    Chen, Jialin
    Zhao, Shu
    Zhang, Yanping
    Tang, Jie
    SCIENTOMETRICS, 2020, 125 (03) : 2091 - 2108
  • [29] Exploiting word embedding for heterogeneous topic model towards patent recommendation
    Jie Chen
    Jialin Chen
    Shu Zhao
    Yanping Zhang
    Jie Tang
    Scientometrics, 2020, 125 : 2091 - 2108
  • [30] Sense-Based Topic Word Embedding Model for Item Recommendation
    Xiao, Ya
    Fan, Zhijie
    Tan, Chengxiang
    Xu, Qian
    Zhu, Wenye
    Cheng, Fujia
    IEEE ACCESS, 2019, 7 : 44748 - 44760