Can Topic Modelling benefit from Word Sense Information?

被引:0
|
作者
Ferrugento, Adriana [1 ]
Oliveira, Hugo Goncalo [1 ]
Alves, Ana Oliveira [1 ,2 ]
Rodrigues, Filipe [1 ]
机构
[1] Univ Coimbra, Dept Informat Engn, CISUC, Coimbra, Portugal
[2] Polytech Inst Coimbra, IPC, Coimbra, Portugal
关键词
topic model; word senses; WordNet; semantics; SemLDA;
D O I
暂无
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
This paper proposes a new topic model that exploits word sense information in order to discover less redundant and more informative topics. Word sense information is obtained from WordNet and the discovered topics are groups of synsets, instead of mere surface words. A key feature is that all the known senses of a word are considered, with their probabilities. Alternative configurations of the model are described and compared to each other and to LDA, the most popular topic model. However, the obtained results suggest that there are no benefits of enriching LDA with word sense information.
引用
收藏
页码:3387 / 3393
页数:7
相关论文
共 50 条
  • [31] Probabilistic Relational Supervised Topic Modelling using Word Embeddings
    Al-Ani, Jabir Alshehabi
    Fasli, Maria
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2035 - 2043
  • [32] Scoliosis Related Information on the Internet in China: Can Patients Benefit from This Information?
    Bao, Hongda
    Zhu, Feng
    Wang, Fei
    Liu, Zhen
    Bao, Mike H.
    He, Shouyu
    Zhu, Zezhang
    Qiu, Yong
    [J]. PLOS ONE, 2015, 10 (02):
  • [33] Nested variational autoencoder for topic modelling on microtexts with word vectors
    Trinh, Trung
    Quan, Tho
    Mai, Trung
    [J]. EXPERT SYSTEMS, 2021, 38 (02)
  • [34] Topic modelling of the Information Society Happy Birthday, Information Society!
    Fanni, Mate
    Eszter, Katona
    Arpad, Knap
    Mihaly, Csoto
    [J]. INFORMACIOS TARSADALOM, 2021, 21 (01): : 10 - 47
  • [35] Can Visual Recognition Benefit from Auxiliary Information in Training?
    Zhang, Qilin
    Hua, Gang
    Liu, Wei
    Liu, Zicheng
    Zhang, Zhengyou
    [J]. COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 65 - 80
  • [36] Redundancy can benefit learning: Evidence from word order and case marking
    Tal, Shira
    Arnon, Inbal
    [J]. COGNITION, 2022, 224
  • [37] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
    Wang, Jie
    Fu, Zhenxin
    Li, Moxin
    Zhang, Haisong
    Zhao, Dongyan
    Yan, Rui
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
  • [38] Topic Models for Word Sense Disambiguation and Token-based Idiom Detection
    Li, Linlin
    Roth, Benjamin
    Sporleder, Caroline
    [J]. ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1138 - 1147
  • [39] Can multilinguality improve Biomedical Word Sense Disambiguation?
    Duque, Andres
    Martinez-Romo, Juan
    Araujo, Lourdes
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 64 : 320 - 332
  • [40] MSS: Investigating the effectiveness of domain combinations and topic features for word sense disambiguation
    NTT Communication Science Laboratories, Japan
    [J]. SemEval - Int. Workshop Semant. Evaluation, Proc, (383-386):