RNN language model with word clustering and class-based output layer

被引:0
|
作者
Yongzhe Shi
Wei-Qiang Zhang
Jia Liu
Michael T Johnson
机构
[1] Tsinghua University,Tsinghua National Laboratory for Information Science and Technology, Department of Electronic Engineering
[2] Marquette University,Department of Electrical Engineering
关键词
Brown word clustering; RNN language model; Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
The recurrent neural network language model (RNNLM) has shown significant promise for statistical language modeling. In this work, a new class-based output layer method is introduced to further improve the RNNLM. In this method, word class information is incorporated into the output layer by utilizing the Brown clustering algorithm to estimate a class-based language model. Experimental results show that the new output layer with word clustering not only improves the convergence obviously but also reduces the perplexity and word error rate in large vocabulary continuous speech recognition.
引用
收藏
相关论文
共 50 条
  • [1] RNN language model with word clustering and class-based output layer
    Shi, Yongzhe
    Zhang, Wei-Qiang
    Liu, Jia
    Johnson, Michael T.
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2013,
  • [2] EXPLOITING DIFFERENT WORD CLUSTERINGS FOR CLASS-BASED RNN LANGUAGE MODELING IN SPEECH RECOGNITION
    Song, Minguang
    Zhao, Yunxin
    Wang, Shaojun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5735 - 5739
  • [3] An Empirical Investigation of Word Class-Based Features for Natural Language Understanding
    Celikyilmaz, Asli
    Sarikaya, Ruhi
    Jeong, Minwoo
    Deoras, Anoop
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (06) : 994 - 1005
  • [4] Language model based on word clustering
    Yuan, Lichi
    [J]. PACLIC 20: Proceedings of the 20th Pacific Asia Conference on Language, Information and Computation, 2006, : 394 - 397
  • [5] A class-based approach to word alignment
    Ker, SJ
    Chang, JS
    [J]. COMPUTATIONAL LINGUISTICS, 1997, 23 (02) : 313 - 343
  • [6] Class-based LSTM Russian Language Model with Linguistic Information
    Kipyatkova, Irina
    Karpov, Alexey
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2470 - 2474
  • [7] BAYESIAN CLASS-BASED LANGUAGE MODELS
    Su, Yi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5564 - 5567
  • [8] A class-based logic language for ontologies
    Benslimane, D
    Hacid, MS
    Terzi, E
    Toumani, F
    [J]. FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2002, 2522 : 56 - 70
  • [9] Unsupervised class-based language model adaptation for spontaneous speech recognition
    Yokoyama, T
    Shinozaki, T
    Iwano, K
    Furui, S
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 236 - 239
  • [10] Statistical language modeling with a class-based n-multigram model
    Deligne, S
    Sagisaka, Y
    [J]. COMPUTER SPEECH AND LANGUAGE, 2000, 14 (03): : 261 - 279