Can Topic Modelling benefit from Word Sense Information?

被引:0
|
作者
Ferrugento, Adriana [1 ]
Oliveira, Hugo Goncalo [1 ]
Alves, Ana Oliveira [1 ,2 ]
Rodrigues, Filipe [1 ]
机构
[1] Univ Coimbra, Dept Informat Engn, CISUC, Coimbra, Portugal
[2] Polytech Inst Coimbra, IPC, Coimbra, Portugal
关键词
topic model; word senses; WordNet; semantics; SemLDA;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper proposes a new topic model that exploits word sense information in order to discover less redundant and more informative topics. Word sense information is obtained from WordNet and the discovered topics are groups of synsets, instead of mere surface words. A key feature is that all the known senses of a word are considered, with their probabilities. Alternative configurations of the model are described and compared to each other and to LDA, the most popular topic model. However, the obtained results suggest that there are no benefits of enriching LDA with word sense information.
引用
收藏
页码:3387 / 3393
页数:7
相关论文
共 50 条
  • [1] A Word Sense Probabilistic Topic Model
    Jin, Peng
    Chen, Xingyuan
    [J]. 2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 401 - 404
  • [2] Topic Modeling for Word Sense Induction
    Knopp, Johannes
    Voelker, Johanna
    Ponzetto, Simone Paolo
    [J]. LANGUAGE PROCESSING AND KNOWLEDGE IN THE WEB, 2013, 8105 : 97 - 103
  • [3] Statistical word sense aware topic models
    Tang, Guoyu
    Xia, Yunqing
    Sun, Jun
    Zhang, Min
    Zheng, Thomas Fang
    [J]. SOFT COMPUTING, 2015, 19 (01) : 13 - 27
  • [4] Statistical word sense aware topic models
    Guoyu Tang
    Yunqing Xia
    Jun Sun
    Min Zhang
    Thomas Fang Zheng
    [J]. Soft Computing, 2015, 19 : 13 - 27
  • [5] Topic Modeling and Word Sense Disambiguation on the Ancora corpus
    Izquierdo, Ruben
    Postma, Marten
    Vossen, Piek
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (55): : 15 - 22
  • [6] Word Sense Induction using Correlated Topic Model
    Thanh Tung Hoang
    Phuong Thai Nguyen
    [J]. 2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 41 - 44
  • [7] Word Sense Disambiguation using Author Topic Model
    Kaneishi, Shougo
    Tajima, Takuya
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON INDEPENDENT COMPUTING (ISIC), 2014, : 78 - 83
  • [8] Word Sense Disambiguation based on Sequence Topic Model using sense dependency
    Yang, Qi
    Li, Ruixuan
    Li, Yuhua
    Gu, Xiwu
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [9] Word sense disambiguation for Information Retrieval
    Uzuner, O
    Katz, B
    Yuret, D
    [J]. SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-99)/ELEVENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-99), 1999, : 985 - 985
  • [10] Modelling of Topic from Hindi Corpus using Word2Vec
    Panigrahi, Sabitra Sankalp
    Panigrahi, Narayan
    Paul, Biswajit
    [J]. 2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, CONTROL AND COMMUNICATION TECHNOLOGY (IAC3T), 2018, : 97 - 100