Subword-Based Compact Reconstruction for Open-Vocabulary Neural Word Embeddings

被引:0
|
作者
Sasaki, Shota [1 ,2 ]
Suzuki, Jun [1 ,2 ]
Inui, Kentaro [1 ,2 ]
机构
[1] RIKEN, Ctr Adv Intelligence Project, Sendai, Miyagi 9808579, Japan
[2] Tohoku Univ, RIKEN, Sendai, Miyagi 9808579, Japan
关键词
Task analysis; Memory management; Semantics; Indexes; Vocabulary; Syntactics; Speech processing; Neural word embeddings; open vocabulary; subwords;
D O I
10.1109/TASLP.2021.3125133
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The methodology of neural word embeddings has become an important fundamental resource for tackling many applications in the artificial intelligence (AI) research field. They have successfully been proven to capture high-quality syntactic and semantic relationships in a vector space. Despite their significant impact, neural word embeddings have several disadvantages. In this paper, we focus on two issues regarding well-trained word embeddings: (i) the massive memory requirement and (ii) the inapplicability of out-of-vocabulary (OOV) words. To overcome these two issues, we propose a method of reconstructing pre-trained word embeddings by using subword information that effectively represents a large number of subword embeddings in a considerably small fixed space while preventing quality degradation from the original word embeddings. The key techniques of our method are twofold: memory-shared embeddings and a variant of the key-value-query self-attention mechanism. Our experiments show that our reconstructed subword-based word embeddings can successfully imitate well-trained word embeddings in a small fixed space while preventing quality degradation across several linguistic benchmark datasets and can simultaneously predict effective embeddings of OOV words. We also demonstrate the effectiveness of our reconstruction method when it is applied to downstream tasks, such as named entity recognition and natural language inference tasks.
引用
收藏
页码:3551 / 3564
页数:14
相关论文
共 50 条
  • [41] Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation
    Yepes, Antonio Jimeno
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 73 : 137 - 147
  • [42] POLYHEDRON-BASED GRAPH NEURAL NETWORK FOR COMPACT BUILDING MODEL RECONSTRUCTION
    Chen, Zhaiyu
    Shi, Yilei
    Xiong, Zhitong
    Zhu, Xiao Xiang
    [J]. IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 923 - 926
  • [43] FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection
    Zhang, Dongmei
    Li, Chang
    Zhang, Renrui
    Xie, Shenghao
    Xue, Wei
    Xie, Xiaodong
    Zhang, Shanghang
    [J]. THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 16723 - 16731
  • [44] Punctuation Restoration for Ukrainian Broadcast Speech Recognition System based on Bidirectional Recurrent Neural Network and Word Embeddings
    Sazhok, Mykola
    Poltieva, Anna
    Robeiko, Valentyna
    Seliukh, Ruslan
    Fedoryn, Dmytro
    [J]. COLINS 2021: COMPUTATIONAL LINGUISTICS AND INTELLIGENT SYSTEMS, VOL I, 2021, 2870
  • [45] Convolutional neural networks for chemical-disease relation extraction are improved with character-based word embeddings
    Dat Quoc Nguyen
    Verspoor, Karin
    [J]. SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), 2018, : 129 - 136
  • [46] RETRACTED: Multi-neural network-based sentiment analysis of food reviews based on character and word embeddings (Retracted Article)
    Li, Yong
    Jin, Qingyu
    Zuo, Min
    Li, Haisheng
    Yang, Xiaojun
    Zhang, Qingchuan
    Liu, Xinliang
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL ENGINEERING EDUCATION, 2020,
  • [47] Development of HMM/neural network-based medium-vocabulary isolated-word Lithuanian speech recognition system
    Filipovic, M
    Lipeika, A
    [J]. INFORMATICA, 2004, 15 (04) : 465 - 474
  • [48] Design of an English vocabulary e-learning recommendation system based on word bag model and recurrent neural network algorithm
    Xian, Hui
    Wu, Shukun
    [J]. ENTERTAINMENT COMPUTING, 2024, 50
  • [49] Utilizing Convolutional Neural Networks and Word Embeddings for Early-Stage Recognition of Persuasion in Chat-Based Social Engineering Attacks
    Tsinganos, Nikolaos
    Mavridis, Ioannis
    Gritzalis, Dimitris
    [J]. IEEE ACCESS, 2022, 10 : 108517 - 108529
  • [50] Solving the Defect in Application of Compact Abating Probability to Convolutional Neural Network Based Open Set Recognition
    Sun, Xiangyuan
    Li, Xiaoyong
    Ren, Kaijun
    Song, Junqiang
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 856 - 863