On Speaker Adaptation of Long Short-Term Memory Recurrent Neural Networks

被引:0
|
作者
Miao, Yajie [1 ]
Metze, Florian [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
Long Short-Term Memory; recurrent neural network; acoustic modeling; speaker adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture specializing in modeling long-range temporal dynamics. On acoustic modeling tasks, LSTM-RNNs have shown better performance than DNNs and conventional RNNs. In this paper, we conduct an extensive study on speaker adaptation of LSTM-RNNs. Speaker adaptation helps to reduce the mismatch between acoustic models and testing speakers. We have two main goals for this study. First, on a benchmark dataset, the existing DNN adaptation techniques are evaluated on the adaptation of LSTM-RNNs. We observe that LSTM-RNNs can be effectively adapted by using speaker-adaptive (SA) front-end, or by inserting speaker-dependent (SD) layers. Second, we propose two adaptation approaches that implement the SD-layer-insertion idea specifically for LSTM-RNNs. Using these approaches, speaker adaptation improves word error rates by 3-4% relative over a strong LSTM-RNN baseline. This improvement is enlarged to 6-7% if we exploit SA features for further adaptation.
引用
收藏
页码:1101 / 1105
页数:5
相关论文
共 50 条
  • [21] LOMBARD SPEECH SYNTHESIS USING LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORKS
    Bollepalli, Bajibabu
    Airaksinen, Manu
    Alku, Paavo
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5505 - 5509
  • [22] Long Short-term Memory based on a Reward/punishment Strategy for Recurrent Neural Networks
    Liu, Jiangjiang
    Luo, Biao
    Yan, Pengfei
    Wang, Ding
    Liu, Derong
    [J]. 2017 32ND YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2017, : 327 - 332
  • [23] Collective Anomaly Detection Based on Long Short-Term Memory Recurrent Neural Networks
    Bontemps, Loic
    Van Loi Cao
    McDermott, James
    Nhien-An Le-Khac
    [J]. FUTURE DATA AND SECURITY ENGINEERING, FDSE 2016, 2016, 10018 : 141 - 152
  • [24] A Comparative Review of Convolutional Neural Networks, Long Short-Term Memory, and Recurrent Neural Networks in Recommendation Systems
    Tyagi, Geetanjali
    Ray, Susmita
    [J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 395 - 408
  • [25] Language Identification in Short Utterances Using Long Short-Term Memory (LSTM) Recurrent Neural Networks
    Zazo, Ruben
    Lozano-Diez, Alicia
    Gonzalez-Dominguez, Javier
    Toledano, Doroteo T.
    Gonzalez-Rodriguez, Joaquin
    [J]. PLOS ONE, 2016, 11 (01):
  • [26] Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
    Abbas, Zainab
    Al-Shishtawy, Ahmad
    Girdzijauskas, Sarunas
    Vlassov, Vladimir
    [J]. 2018 IEEE INTERNATIONAL CONGRESS ON BIG DATA (IEEE BIGDATA CONGRESS), 2018, : 57 - 65
  • [27] Predicting Short-term Traffic Flow by Long Short-Term Memory Recurrent Neural Network
    Tian, Yongxue
    Pan, Li
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART CITY/SOCIALCOM/SUSTAINCOM (SMARTCITY), 2015, : 153 - 158
  • [28] Long short-term memory-based deep recurrent neural networks for target tracking
    Gao, Chang
    Yan, Junkun
    Zhou, Shenghua
    Varshney, Pramod K.
    Liu, Hongwei
    [J]. INFORMATION SCIENCES, 2019, 502 : 279 - 296
  • [29] Using Ant Colony Optimization to Optimize Long Short-Term Memory Recurrent Neural Networks
    ElSaid, AbdElRahman
    El Jamiy, Fatima
    Higgins, James
    Wild, Brandon
    Desell, Travis
    [J]. GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2018, : 13 - 20
  • [30] Long short-term memory-based recurrent neural networks for nonlinear target tracking
    Gao, Chang
    Yan, Junkun
    Zhou, Shenghua
    Chen, Bo
    Liu, Hongwei
    [J]. SIGNAL PROCESSING, 2019, 164 : 67 - 73