Topic Models for Word Sense Disambiguation and Token-based Idiom Detection

被引:0
|
作者
Li, Linlin [1 ]
Roth, Benjamin [1 ]
Sporleder, Caroline [1 ]
机构
[1] Univ Saarland, Postfach 15 11 50, D-66041 Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a probabilistic model for sense disambiguation which chooses the best sense based on the conditional probability of sense paraphrases given a context. We use a topic model to decompose this conditional probability into two conditional probabilities with latent variables. We propose three different instantiations of the model for solving sense disambiguation problems with different degrees of resource availability. The proposed models are tested on three different tasks: coarse-grained word sense disambiguation, fine-grained word sense disambiguation, and detection of literal vs. non-literal usages of potentially idiomatic expressions. In all three cases, we outperform state-of-the-art systems either quantitatively or statistically significantly.
引用
收藏
页码:1138 / 1147
页数:10
相关论文
共 50 条
  • [1] Knowledge-Based Word Sense Disambiguation Using Topic Models
    Chaplot, Devendra Singh
    Salakhutdinov, Ruslan
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 5062 - 5069
  • [2] Word Sense Disambiguation based on Sequence Topic Model using sense dependency
    Yang, Qi
    Li, Ruixuan
    Li, Yuhua
    Gu, Xiwu
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [3] Word Sense Disambiguation using Author Topic Model
    Kaneishi, Shougo
    Tajima, Takuya
    [J]. 2014 IEEE INTERNATIONAL SYMPOSIUM ON INDEPENDENT COMPUTING (ISIC), 2014, : 78 - 83
  • [4] Topic Modeling and Word Sense Disambiguation on the Ancora corpus
    Izquierdo, Ruben
    Postma, Marten
    Vossen, Piek
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2015, (55): : 15 - 22
  • [5] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [6] Word Sense Disambiguation by Context Detection
    Rahman, Mohammad Marufur
    Khan, Saeed Anwar
    Hasan, K. M. Azharul
    [J]. 2019 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2019,
  • [7] Token-based Plagiarism Detection for Metamodels
    Saglam, Timur
    Hahner, Sebastian
    Wittler, Jan Willem
    Kuehn, Thomas
    [J]. ACM/IEEE 25TH INTERNATIONAL CONFERENCE ON MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, MODELS 2022 COMPANION, 2022, : 138 - 141
  • [8] Correlation Based Word Sense Disambiguation
    Agarwal, Madhavi
    Bajpai, Jyoti
    [J]. 2014 SEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2014, : 382 - 386
  • [9] WordNet Based Word Sense Disambiguation
    Sieminski, Andrzej
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT II: THIRD INTERNATIONAL CONFERENCE, ICCCI 2011, 2011, 6923 : 405 - 414
  • [10] Graph Based Word Sense Disambiguation
    Koppula, Neeraja
    Rani, B. Padmaja
    Rao, Koppula Srinivas
    [J]. PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS, ICCII 2016, 2017, 507 : 665 - 670