RNN language model with word clustering and class-based output layer

被引:0
|
作者
Yongzhe Shi
Wei-Qiang Zhang
Jia Liu
Michael T Johnson
机构
[1] Tsinghua University,Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering
[2] Marquette University,Department of Electrical Engineering
关键词
Brown word clustering; RNN language model; Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
The recurrent neural network language model (RNNLM) has shown significant promise for statistical language modeling. In this work, a new class-based output layer method is introduced to further improve the RNNLM. In this method, word class information is incorporated into the output layer by utilizing the Brown clustering algorithm to estimate a class-based language model. Experimental results show that the new output layer with word clustering not only improves the convergence obviously but also reduces the perplexity and word error rate in large vocabulary continuous speech recognition.
引用
收藏
相关论文
共 50 条
  • [21] Word clustering based on similarity and vari-gram language model
    Yuan, LC
    Zhong, YX
    [J]. ICCC2004: Proceedings of the 16th International Conference on Computer Communication Vol 1and 2, 2004, : 1222 - 1226
  • [22] Enhancement of the word2vec Class-Based Language Modeling by Optimizing the Features Vector using PCA
    Abdulhameed, Tiba Zaki
    Zitouni, Imed
    Abdel-Qader, Ikhlas
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 866 - 870
  • [23] TOKEN-LEVEL INTERPOLATION FOR CLASS-BASED LANGUAGE MODELS
    Levit, Michael
    Stolcke, Andreas
    Chang, Shuangyu
    Parthasarathy, Sarangarajan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5426 - 5430
  • [24] Efficient class-based language modelling for very large vocabularies
    Whittaker, EWD
    Woodland, PC
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 545 - 548
  • [25] Cross-lingual projection for class-based language models
    Gfeller, Beat
    Schogol, Vlad
    Hall, Keith
    [J]. PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2016), VOL 2, 2016, : 83 - 88
  • [26] Constrained clustering method for class-based storage location assignment in warehouse
    Yang, Chao-Lung
    Thi Phuong Quyen Nguyen
    [J]. INDUSTRIAL MANAGEMENT & DATA SYSTEMS, 2016, 116 (04) : 667 - 689
  • [27] A chemical class-based approach to predictive model generation
    Miller, DW
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02): : 568 - 578
  • [28] Model-Based and Class-Based Fusion of Multisensor Data
    Tsanousa, Athina
    Chatzimichail, Angelos
    Meditskos, Georgios
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    [J]. MULTIMEDIA MODELING (MMM 2020), PT II, 2020, 11962 : 614 - 625
  • [29] Research on Class-Based Storage Strategies for Flood Control Materials Based on Grey Clustering
    Wang, Wei
    Huang, Li
    Le, Weiwei
    Zhang, Yu
    Ji, Peng
    Proverbs, David
    [J]. WATER, 2018, 10 (11)
  • [30] Word sense disambiguation using semantic kernels with class-based term values
    Altinel, Berna
    Ganiz, Murat Can
    Sipal, Bilge
    Erkaya, Erencan
    Yucedag, Onur Can
    Dogan, Muhammed Ali
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (04) : 3180 - 3194