Joint Learning of Sense and Word Embeddings

被引:0
|
作者
Alsuhaibani, Mohammed [1 ]
Bollegala, Danushka [1 ]
机构
[1] Univ Liverpool, Dept Comp Sci, Liverpool, Merseyside, England
关键词
Sense Embeddings; Word embeddings; Labelled Data; Unlabelled Data;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Methods for learning lower-dimensional representations (embeddings) of words using unlabelled data have received a renewed interested due to their myriad success in various Natural Language Processing (NLP) tasks. However, despite their success, a common deficiency associated with most word embedding learning methods is that they learn a single representation for a word, ignoring the different senses of that word (polysemy). To address the polysemy problem, we propose a method that jointly learns sense-aware word embeddings using both unlabelled and sense-tagged text corpora. In particular, our proposed method can learn both word and sense embeddings by efficiently exploiting both types of resources. Our quantitative and qualitative experimental results using unlabelled text corpus with (a) manually annotated word senses, and (b) pseudo annotated senses demonstrate that the proposed method can correctly learn the multiple senses of an ambiguous word. Moreover, the word embeddings learnt by our proposed method outperform several previously proposed competitive word embedding learning methods on word similarity and short-text classification benchmark datasets.
引用
收藏
页码:223 / 229
页数:7
相关论文
共 50 条
  • [21] Evaluation of Stacked Embeddings for Arabic Word Sense Disambiguation
    Laatar, Rim
    Aloulou, Chafik
    Belguith, Lamia Hadrich
    [J]. COMPUTACION Y SISTEMAS, 2023, 27 (02): : 379 - 388
  • [22] Probabilistic FastText for Multi-Sense Word Embeddings
    Athiwaratkun, Ben
    Wilson, Andrew Gordon
    Anandkumar, Anima
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1 - 11
  • [23] Unsupervised Joint Training of Bilingual Word Embeddings
    Marie, Benjamin
    Fujita, Atsushi
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 3224 - 3230
  • [24] SENSEMBERT: Context-Enhanced Sense Embeddings for Multilingual Word Sense Disambiguation
    Scarlini, Bianca
    Pasini, Tommaso
    Navigli, Roberto
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8758 - 8765
  • [25] Zero-shot Word Sense Disambiguation using Sense Definition Embeddings
    Kumar, Sawan
    Jat, Sharmistha
    Saxena, Karan
    Talukdar, Partha
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5670 - 5681
  • [26] Word sense induction using word embeddings and community detection in complex networks
    Correa, Edilson A., Jr.
    Amancio, Diego R.
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 523 : 180 - 190
  • [27] Supervised word sense disambiguation using new features based on word embeddings
    Sadi, Majid Fahandezi
    Ansari, Ebrahim
    Afsharchi, Mohsen
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (01) : 1467 - 1476
  • [28] A Neural Generative Model for Joint Learning Topics and Topic-Specific Word Embeddings
    Zhu, Lixing
    He, Yulan
    Zhou, Deyu
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2020, 8 : 471 - 485
  • [29] Large scale image annotation: learning to rank with joint word-image embeddings
    Jason Weston
    Samy Bengio
    Nicolas Usunier
    [J]. Machine Learning, 2010, 81 : 21 - 35
  • [30] Large scale image annotation: learning to rank with joint word-image embeddings
    Weston, Jason
    Bengio, Samy
    Usunier, Nicolas
    [J]. MACHINE LEARNING, 2010, 81 (01) : 21 - 35