Improving WordNet using Word Embeddings

被引:2
|
作者
Chiru, Costin-Gabriel [1 ]
Truica, Ciprian-Octavian [1 ]
Apostol, Elena-Simona [1 ]
Ionescu, Alexandru [1 ]
机构
[1] Univ Politehn Bucuresti, Fac Automat Control & Comp, Comp Sci & Engn Dept, Bucharest, Romania
关键词
WordNet; Word2Vec; semantic similarity; semantic change; word embeddings;
D O I
10.1109/SYNASC54541.2021.00030
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The main objective of this paper is to create a proof of concept regarding the improvement of the human-generated database WordNet using computer-generated information from Word2Vec. Thus, we change the WordNet content, using the information from existing corpora. The main method used to achieve this goal is by comparing the results of path algorithms for computing the semantic similarities between WordNet concepts (such as Path, Wu and Palmer, or Leacock and Chodorow similarities), with cosine similarity between the Word2Vec vectors of the same concepts. One way to improve WordNet is by adding new concepts from the Word2Vec corpus which have strong connections with existing words from WordNet. Another way to improve it is by updating its existing connections to underline semantic change. Our experimental results prove that the method we propose may be used to improve the number of concepts and the quality of links between synsets in WordNet, creating a more meaningful semantic resource.
引用
收藏
页码:121 / 128
页数:8
相关论文
共 50 条
  • [1] Intrinsic Evaluation of Lithuanian Word Embeddings Using WordNet
    Kapociute-Dzikiene, Jurgita
    Damasevicius, Robertas
    [J]. ARTIFICIAL INTELLIGENCE AND ALGORITHMS IN INTELLIGENT SYSTEMS, 2019, 764 : 394 - 404
  • [2] Improving Vietnamese WordNet using word embedding
    Khang Nhut Lam
    Tuan Huynh To
    Thong Tri Tran
    Kalita, Jugal
    [J]. NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 110 - 114
  • [3] Improving Word Embeddings Using Kernel PCA
    Gupta, Vishwani
    Giesselbach, Sven
    Rueping, Stefan
    Bauckhage, Christian
    [J]. 4TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP (REPL4NLP-2019), 2019, : 200 - 208
  • [4] WordNet Embeddings
    Saedi, Chakaveh
    Branco, Antonio
    Rodrigues, Joao Antonio
    Silva, Joao Ricardo
    [J]. REPRESENTATION LEARNING FOR NLP, 2018, : 122 - 131
  • [5] Improving accuracy of an existing semantic word labelling tool using word embeddings
    Sanjurjo-Gonzalez, Hugo
    [J]. PROCEEDINGS OF 2021 16TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2021), 2021,
  • [6] Application of WordNet and word embeddings in the development of prototypes for automatic language generation
    Dominguez Vazquez, Maria Jose
    [J]. LINGUAMATICA, 2020, 12 (02): : 71 - 80
  • [7] Improving document representation using KPCA and clustered word embeddings
    Gupta, Aakansha
    Katarya, Rahul
    [J]. 2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 514 - 517
  • [8] Improving Word Embeddings for Antonym Detection Using Thesauri and SentiWordNet
    Dou, Zehao
    Wei, Wei
    Wan, Xiaojun
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2018, PT II, 2018, 11109 : 67 - 79
  • [9] Improving Word Recognition using Multiple Hypotheses and Deep Embeddings
    Bansal, Siddhant
    Krishnan, Praveen
    Jawahar, C., V
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9499 - 9506
  • [10] Improving seller–customer communication process using word embeddings
    Malik Muhammad Saad Missen
    Aqsa Naeem
    Hina Asmat
    Nadeem Salamat
    Nadeem Akhtar
    Mickaël Coustaty
    V. B. Surya Prasath
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2021, 12 : 2257 - 2272