LIMITED-MEMORY BFGS OPTIMIZATION OF RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR SPEECH RECOGNITION

被引:0
|
作者
Liu, Xunying [1 ]
Liu, Shansong [1 ]
Sha, Jinze [2 ]
Yu, Jianwei [1 ]
Xu, Zhiyuan [2 ]
Chen, Xie [2 ]
Meng, Helen [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Hong Kong, Hong Kong, Peoples R China
[2] Univ Cambridge, Engn Dept, Cambridge, England
关键词
recurrent neural network; language model; second order optimization; speech recognition; limited-memory BFGS;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recurrent neural network language models (RNNLM) have become an increasingly popular choice for state-of-the-art speech recognition systems. RNNLMs are normally trained by minimizing the cross entropy (CE) using the stochastic gradient descent (SGD) algorithm. The SGD method only uses first-order derivatives and no higher order gradient information is used to consider the correlation between model parameters. It is unable to fully capture the curvature of the error cost function. This can lead to slow convergence in model training. In this paper, a limited-memory Broyden Fletcher Goldfarb Shannon (L-BFGS) based second order optimization technique is proposed for RNNLMs. This method efficiently approximates the matrix-vector product between the inverse Hessian and gradient vector via a recursion over past gradients with a compact memory requirement. Consistent perplexity and error rate reductions are obtained over the SGD method on two speech recognition tasks: Switchboard English and Babel Cantonese. A faster convergence and speed up in RNNLM training time was also obtained.
引用
收藏
页码:6114 / 6118
页数:5
相关论文
共 50 条
  • [1] ADAPTIVE, LIMITED-MEMORY BFGS ALGORITHMS FOR UNCONSTRAINED OPTIMIZATION
    Boggs, Paul T.
    Byrd, Richard H.
    [J]. SIAM JOURNAL ON OPTIMIZATION, 2019, 29 (02) : 1282 - 1299
  • [2] Limited-memory BFGS with displacement aggregation
    Berahas, Albert S.
    Curtis, Frank E.
    Zhou, Baoyu
    [J]. MATHEMATICAL PROGRAMMING, 2022, 194 (1-2) : 121 - 157
  • [3] Limited-memory BFGS with displacement aggregation
    Albert S. Berahas
    Frank E. Curtis
    Baoyu Zhou
    [J]. Mathematical Programming, 2022, 194 : 121 - 157
  • [4] Investigating Bidirectional Recurrent Neural Network Language Models for Speech Recognition
    Chen, X.
    Ragni, A.
    Liu, X.
    Gales, M. J. F.
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 269 - 273
  • [5] BIDIRECTIONAL RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR AUTOMATIC SPEECH RECOGNITION
    Arisoy, Ebru
    Sethy, Abhinav
    Ramabhadran, Bhuvana
    Chen, Stanley
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5421 - 5425
  • [6] Limited-memory BFGS systems with diagonal updates
    Erway, Jennifer B.
    Marcia, Roummel F.
    [J]. LINEAR ALGEBRA AND ITS APPLICATIONS, 2012, 437 (01) : 333 - 344
  • [7] GAUSSIAN PROCESS LSTM RECURRENT NEURAL NETWORK LANGUAGE MODELS FOR SPEECH RECOGNITION
    Lam, Max W. Y.
    Chen, Xie
    Hu, Shoukang
    Yu, Jianwei
    Liu, Xunying
    Meng, Helen
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 7235 - 7239
  • [8] Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition
    Masumura, Ryo
    Asami, Taichi
    Oba, Takanobu
    Sakauchi, Sumitaka
    Ito, Akinori
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2557 - 2567
  • [9] Efficient Training and Evaluation of Recurrent Neural Network Language Models for Automatic Speech Recognition
    Chen, Xie
    Liu, Xunying
    Wang, Yongqiang
    Gales, Mark J. F.
    Woodland, Philip C.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2146 - 2157
  • [10] Analysis of limited-memory BFGS on a class of nonsmooth convex functions
    Asl, Azam
    Overton, Michael L.
    [J]. IMA JOURNAL OF NUMERICAL ANALYSIS, 2021, 41 (01) : 1 - 27