Semantically-Enhanced Topic Modeling

被引:10
|
作者
Viegas, Felipe [1 ]
Luiz, Washington [1 ]
Gomes, Christian [2 ]
Khatibi, Amir [1 ]
Canuto, Sergio [3 ]
Mourao, Fernando [4 ]
Salles, Thiago [1 ]
Rocha, Leonardo [2 ]
Goncalves, Marcos Andre [1 ]
机构
[1] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
[2] Univ Fed Sao Joao del Rei, Sao Joao del Rei, Brazil
[3] IFG, Luziania, Brazil
[4] Seek AI Labs, Belo Horizonte, MG, Brazil
关键词
Topic Modeling; Word Embeddings; Bag of Words;
D O I
10.1145/3269206.3271797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we advance the state-of-the-art in topic modeling by means of the design and development of a novel (semi-formal) general topic modeling framework. The novel contributions of our solution include: (i) the introduction of new semantically-enhanced data representations for topic modeling based on pooling, and (ii) the proposal of a novel topic extraction strategy - ASToC -that solves the difficulty in representing topics in our semantically-enhanced information space. In our extensive experimentation evaluation, covering 12 datasets and 12 state-of-the-art baselines, totalizing 108 tests, we exceed (with a few ties) in almost 100 cases, with gains of more than 50% against the best baselines (achieving up to 80% against some runner-ups). We provide qualitative and quantitative statistical analyses of why our solutions work so well. Finally, we show that our method is able to improve document representation in automatic text classification.
引用
收藏
页码:893 / 902
页数:10
相关论文
共 50 条
  • [1] Semantically-Enhanced Ubiquitous User Modeling
    Plumbaum, Till
    [J]. USER MODELING, ADAPTATION, AND PERSONALIZATION, PROCEEDINGS, 2010, 6075 : 407 - 410
  • [2] Semantically-enhanced topic recommendation systems for software projects
    Izadi, Maliheh
    Nejati, Mahtab
    Heydarnoori, Abbas
    [J]. EMPIRICAL SOFTWARE ENGINEERING, 2023, 28 (02)
  • [3] Semantically-enhanced topic recommendation systems for software projects
    Maliheh Izadi
    Mahtab Nejati
    Abbas Heydarnoori
    [J]. Empirical Software Engineering, 2023, 28
  • [4] Semantically-Enhanced Recommenders
    Codina, Victor
    Ceccaroni, Luigi
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2012, 248 : 69 - 78
  • [5] Verification of Semantically-Enhanced Artifact Systems
    Hariri, Babak Bagheri
    Calvanese, Diego
    Montali, Marco
    Santoso, Ario
    Solomakhin, Dmitry
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 600 - 607
  • [6] SePeRe: Semantically-Enhanced System for Pest Recognition
    Garceran-Saez, Jesus
    Garcia-Sanchez, Francisco
    [J]. ICT FOR AGRICULTURE AND ENVIRONMENT, 2019, 901 : 3 - 11
  • [7] A Semantically-Enhanced Modelling Environment for Business Process as a Service
    Hinkelmann, Knut
    Kurjakovic, Sabrina
    Lammel, Benjamin
    Laurenzi, Emanuele
    Woitsch, Robert
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON ENTERPRISE SYSTEMS (ES) PROCEEDINGS, 2016, : 143 - 152
  • [8] Semantically-enhanced extension of the discussion analysis algorithm in SAKE
    Lukac, G.
    Butka, P.
    Mach, M.
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, 2008, : 224 - +
  • [9] Sequential visual place recognition using semantically-enhanced features
    Paturkar, Varun
    Yadav, Rohit
    Kala, Rahul
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (17) : 50477 - 50491
  • [10] Semantically-enhanced information retrieval using multiple knowledge sources
    Jiang, Yuncheng
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04): : 2925 - 2944