Can Topic Modelling benefit from Word Sense Information?

被引:0
|
作者
Ferrugento, Adriana [1 ]
Oliveira, Hugo Goncalo [1 ]
Alves, Ana Oliveira [1 ,2 ]
Rodrigues, Filipe [1 ]
机构
[1] Univ Coimbra, Dept Informat Engn, CISUC, Coimbra, Portugal
[2] Polytech Inst Coimbra, IPC, Coimbra, Portugal
关键词
topic model; word senses; WordNet; semantics; SemLDA;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper proposes a new topic model that exploits word sense information in order to discover less redundant and more informative topics. Word sense information is obtained from WordNet and the discovered topics are groups of synsets, instead of mere surface words. A key feature is that all the known senses of a word are considered, with their probabilities. Alternative configurations of the model are described and compared to each other and to LDA, the most popular topic model. However, the obtained results suggest that there are no benefits of enriching LDA with word sense information.
引用
收藏
页码:3387 / 3393
页数:7
相关论文
共 50 条
  • [21] Spreading semantic information by Word Sense Disambiguation
    Gutierrez, Yoan
    Vazquez, Sonia
    Montoyo, Andres
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 132 : 47 - 61
  • [22] Word sense language model for information retrieval
    Gao, Liqi
    Zhang, Yu
    Liu, Ting
    Liu, Guiping
    [J]. INFORMATION RETRIEVAL TECHNOLOGY, PROCEEDINGS, 2006, 4182 : 158 - 171
  • [23] Arabic Word Sense Disambiguation for Information Retrieval
    Abderrahim, Mohammed Alaeddine
    Abderrahim, Mohammed El-Amine
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (04)
  • [24] Word Sense Disambiguation by Information Filtering and Extraction
    Jeremy Ellman
    Ian Klincke
    John Tait
    [J]. Computers and the Humanities, 2000, 34 : 127 - 134
  • [25] Word sense disambiguation using implicit information
    Jain, Goonjan
    Lobiyal, D. K.
    [J]. NATURAL LANGUAGE ENGINEERING, 2020, 26 (04) : 413 - 432
  • [26] Information retrieval by means of word sense disambiguation
    Ureña, LA
    Hidalgo, JMG
    de Buenaga, M
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 93 - 98
  • [27] Information Retrieval with Word Sense Disambiguation for Spanish
    Ledo Mezquita, Yoel
    [J]. COMPUTACION Y SISTEMAS, 2008, 11 (03): : 288 - 300
  • [28] Word sense disambiguation by information filtering and extraction
    Ellman, J
    Klincke, I
    Tait, J
    [J]. COMPUTERS AND THE HUMANITIES, 2000, 34 (1-2): : 127 - 134
  • [29] Can pragmatic inference benefit from topic prominence? ERP evidence from Mandarin Chinese
    Chen, Lijuan
    Xu, Xiaodong
    Chen, Qingrong
    Royle, Phaedra
    [J]. JOURNAL OF NEUROLINGUISTICS, 2018, 46 : 11 - 22
  • [30] Leveraging Unstructured Information Using Topic Modelling
    Uys, J. W.
    du Preez, N. D.
    Uys, E. W.
    [J]. 2008 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING & TECHNOLOGY, VOLS 1-5, 2008, : 955 - 961