Supervised word sense disambiguation using new features based on word embeddings

被引:3
|
作者
Sadi, Majid Fahandezi [1 ]
Ansari, Ebrahim [2 ,3 ]
Afsharchi, Mohsen [1 ]
机构
[1] Univ Zanjan, Dept Comp Engn, Univ Zanjan Blvd, Zanjan, Iran
[2] IASBS, Dept Comp Sci & Informat Technol, Zanjan, Iran
[3] IASBS, Res Ctr Basic Sci & Modern Technol RBST, Zanjan, Iran
关键词
Word sense disambiguation; Word embedding; Supervised learning; Support vector machine;
D O I
10.3233/JIFS-182868
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised Word Sense Disambiguation (WSD) systems use features of the target word and its context to learn about all possible samples in an annotated dataset. Recently, word embeddings have emerged as a powerful feature in many NLP tasks. In supervised WSD, word embeddings can be used as a high-quality feature representing the context of an ambiguous word. In this paper, four improvements to existing state-of-the-art WSD methods are proposed. First, we propose a new model for assigning vector coefficients for a more precise context representation. Second, we apply a PCA dimensionality reduction process to find a better transformation of feature matrices and train a more informative model. Third, a new weighting scheme is suggested to tackle the problem of unbalanced data in standard WSD datasets and finally, a novel idea is presented to combine word embedding features extracted from different independent corpora, which uses a voting aggregator among available trained models. All of these proposals individually improve disambiguation performance on Standard English lexical sample tasks, and using the combination of all proposed ideas makes a significant improvement in the accuracy score.
引用
收藏
页码:1467 / 1476
页数:10
相关论文
共 50 条
  • [1] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    [J]. PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [2] Domain Adaptation for Word Sense Disambiguation Using Word Embeddings
    Komiya, Kanako
    Suzuki, Shota
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Okumura, Manabu
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2017), PT I, 2018, 10761 : 195 - 206
  • [3] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    [J]. 11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [4] Word sense disambiguation: an evaluation study of semi-supervised approaches with word embeddings
    Sousa, Samuel
    Milios, Evangelos
    Berton, Lilian
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] A New Approach to the Supervised Word Sense Disambiguation
    Agre, Gennady
    Petrov, Daniel
    Keskinova, Simona
    [J]. ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 3 - 15
  • [6] Word Sense Disambiguation for 158 Languages using Word Embeddings Only
    Logacheva, Varvara
    Teslenko, Denis
    Shelmanov, Artem
    Remus, Steffen
    Ustalov, Dmitry
    Kutuzov, Andrey
    Artemova, Ekaterina
    Biemann, Chris
    Ponzetto, Simone Paolo
    Panchenko, Alexander
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5943 - 5952
  • [7] Word Embeddings of Monosemous Words in Dictionary for Word Sense Disambiguation
    Sasaki, Minoru
    [J]. SEMAPRO 2018: THE TWELFTH INTERNATIONAL CONFERENCE ON ADVANCES IN SEMANTIC PROCESSING, 2018, : 4 - 7
  • [8] The Effect of the Number of Features to Supervised Chinese Word Sense Disambiguation
    Liu, Pengyuan
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (02) : 313 - 318
  • [9] Combining Local and Global Features in Supervised Word Sense Disambiguation
    Lei, Xue
    Cai, Yi
    Li, Qing
    Xie, Haoran
    Leung, Ho-fung
    Wang, Fu Lee
    [J]. WEB INFORMATION SYSTEMS ENGINEERING, WISE 2017, PT II, 2017, 10570 : 117 - 131
  • [10] Effect of Supervised Sense Disambiguation Model Using Machine Learning Technique and Word Embedding in Word Sense Disambiguation
    Mahajan, Rupesh
    Kokane, Chandrakant
    Pathak, Kishor
    Kodmelwar, Manohar
    Wagh, Kapil
    Bhandari, Mahesh
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (01) : 436 - 443