An approach to vocabulary expansion for neural network language model by means of hierarchical clustering

被引:0
|
作者
Pavel, Dudarin [1 ]
Nadezhda, Yarushkina [1 ]
机构
[1] Ulyanovsk State Tech Univ, Informat Syst, Fac Informat Syst & Technol, Severny Venetz St 32, Ulyanovsk, Russia
基金
俄罗斯基础研究基金会;
关键词
NLP; Language model; Neural Network; RNN; ULMFiT; Clustering; Fuzzy graph clustering; Word-to-vec;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network language models become the main tool to solve tasks in NLP field. These models already have shown state-of-the-art results in classification, translation, named entity recognition and so on. Pre-trained models are distributed freely in the internet, and could be reused with help of transfer learning techniques. However, the real life problem's domain could differ from the origin domain which the network was trained. In this paper an approach to vocabulary expansion for neural network language model by means of hierarchical clustering is proposed. This technique allows to adopt prerained language model to a different domain. Firstly, tokens from the language model are hierarchically clustered. Then new words from problem's domain are matched to the tokens accordingly obtained hierarchy. In the experimental part the proposed approach is demonstrated on the slightly modified ULM-FiT language model.
引用
收藏
页码:614 / 618
页数:5
相关论文
共 50 条
  • [1] A Text Clustering Approach of Chinese News Based on Neural Network Language Model
    Fan, Zhaoxin
    Chen, Shuoying
    Zha, Li
    Yang, Jiadong
    [J]. INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2016, 44 (01) : 198 - 206
  • [2] A Text Clustering Approach of Chinese News Based on Neural Network Language Model
    Zhaoxin Fan
    Shuoying Chen
    Li Zha
    Jiadong Yang
    [J]. International Journal of Parallel Programming, 2016, 44 : 198 - 206
  • [3] Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR
    Khassanov, Yerbolat
    Chng, Eng Siong
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3343 - 3347
  • [4] Hierarchical clustering by means of model grouping
    Agostinelli, C
    Pellizzari, P
    [J]. FROM DATA AND INFORMATION ANALYSIS TO KNOWLEDGE ENGINEERING, 2006, : 246 - +
  • [5] Clustering approach based on hierarchical expansion for community detection of scientific collaboration network
    李晓慧
    Zheng Yanning
    [J]. High Technology Letters, 2016, 22 (04) : 419 - 425
  • [6] Clustering approach based on hierarchical expansion for community detection of scientific collaboration network
    Li X.
    Zheng Y.
    [J]. Li, Xiaohui (xhli@istic.ac.cn), 1600, Inst. of Scientific and Technical Information of China (22): : 419 - 425
  • [7] Clustering: A neural network approach
    Du, K. -L.
    [J]. NEURAL NETWORKS, 2010, 23 (01) : 89 - 107
  • [8] An artificial neural network approach for the language learning model
    Zulqurnain Sabir
    Salem Ben Said
    Qasem Al-Mdallal
    [J]. Scientific Reports, 13
  • [9] An artificial neural network approach for the language learning model
    Sabir, Zulqurnain
    Ben Said, Salem
    Al-Mdallal, Qasem
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [10] Large Vocabulary SOUL Neural Network Language Models
    Le, Hai-Son
    Oparin, Ilya
    Messaoudi, Abdel
    Allauzen, Alexandre
    Gauvain, Jean-Luc
    Yvon, Francois
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1480 - +