A Naive Bayes Approach to Cross-Lingual Word Sense Disambiguation and Lexical Substitution

被引:0
|
作者
Pinto, David [1 ]
Vilarino, Darnes [1 ]
Balderas, Carlos [1 ]
Tovar, Mireya [1 ]
Beltran, Beatriz [1 ]
机构
[1] B Autonomous Univ Puebla, Fac Comp Sci, Puebla, Mexico
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Word Sense Disambiguation (WSD) is considered one of the most important problems in Natural Language Processing [1]. Et is claimed that WSD is essential for those applications that require of language comprehension modules such as search engines, machine translation systems, automatic answer machines, second life agents, etc. Moreover, with the huge amounts of information in Internet and the fact that this information is continuosly growing in different languages, we are encourage to deal with cross-lingual scenarios where WSD systems are also needed. On the other hand, Lexical Substitution (LS) refers to the process of finding a substitute word for a source word in a given sentence. The LS task needs to be approached by firstly disambiguating the source word, therefore, these two tasks (WSD and LS) are somehow related. In this paper, we present a naive approach to tackle the problem of cross-lingual WSD and cross-lingual lexical substitution. We use a bilingual statistical dictionary, which is calculated with Giza++ by using the EUROPARL parallel corpus, in order to calculate the probability of a source word to be translated to a target word (which is assumed to be the correct sense of the source word but in a different language). Two versions of the probabilistic model are tested: unweighted and weighted. The results were compared with those of an international competition, obtaining a good performance.
引用
收藏
页码:352 / 361
页数:10
相关论文
共 50 条
  • [21] Cross-Lingual Word Embeddings
    Søgaard, Anders
    Vulić, Ivan
    Ruder, Sebastian
    Faruqui, Manaal
    [J]. Synthesis Lectures on Human Language Technologies, 2019, 12 (02): : 1 - 132
  • [22] Cross-Lingual Word Embeddings
    Agirre, Eneko
    [J]. COMPUTATIONAL LINGUISTICS, 2020, 46 (01) : 245 - 248
  • [23] Cross-Lingual Word Embeddings
    Corro, Caio Filippo
    [J]. TRAITEMENT AUTOMATIQUE DES LANGUES, 2019, 60 (01): : 46 - 48
  • [24] Cross-lingual Lexical Sememe Prediction
    Qi, Fanchao
    Lin, Yankai
    Sun, Maosong
    Zhu, Hao
    Xie, Ruobing
    Liu, Zhiyuan
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 358 - 368
  • [25] Cross-Lingual Preposition Disambiguation for Machine Translation
    Kumar, M. Anand
    Rajendran, S.
    Soman, K. P.
    [J]. ELEVENTH INTERNATIONAL CONFERENCE ON COMMUNICATION NETWORKS, ICCN 2015/INDIA ELEVENTH INTERNATIONAL CONFERENCE ON DATA MINING AND WAREHOUSING, ICDMW 2015/NDIA ELEVENTH INTERNATIONAL CONFERENCE ON IMAGE AND SIGNAL PROCESSING, ICISP 2015, 2015, 54 : 291 - 300
  • [26] On the quality of lexical resources for Word Sense Disambiguation
    Màrquez, L
    Taulé, M
    Padró, L
    Villarejo, L
    Martí, MA
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, 2004, 3230 : 291 - 302
  • [27] Generalized Tuning of Distributional Word Vectors for Monolingual and Cross-Lingual Lexical Entailment
    Glavas, Goran
    Vulic, Ivan
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4824 - 4830
  • [28] Unsupervised Ranked Cross-Lingual Lexical Substitution for Low-Resource Languages
    Ecker, Stefan
    Horbach, Andrea
    Thater, Stefan
    [J]. LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2016, : 1709 - 1717
  • [29] A Variational Autoencoding Approach for Inducing Cross-lingual Word Embeddings
    Wei, Liangchen
    Deng, Zhi-Hong
    [J]. PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4165 - 4171
  • [30] Multilingual and Cross-Lingual Graded Lexical Entailment
    Vulic, Ivan
    Ponzetto, Simone Paolo
    Glavas, Goran
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4963 - 4974