On Speaker Adaptation of Long Short-Term Memory Recurrent Neural Networks

被引:0
|
作者
Miao, Yajie [1 ]
Metze, Florian [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Sch Comp Sci, Pittsburgh, PA 15213 USA
关键词
Long Short-Term Memory; recurrent neural network; acoustic modeling; speaker adaptation;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture specializing in modeling long-range temporal dynamics. On acoustic modeling tasks, LSTM-RNNs have shown better performance than DNNs and conventional RNNs. In this paper, we conduct an extensive study on speaker adaptation of LSTM-RNNs. Speaker adaptation helps to reduce the mismatch between acoustic models and testing speakers. We have two main goals for this study. First, on a benchmark dataset, the existing DNN adaptation techniques are evaluated on the adaptation of LSTM-RNNs. We observe that LSTM-RNNs can be effectively adapted by using speaker-adaptive (SA) front-end, or by inserting speaker-dependent (SD) layers. Second, we propose two adaptation approaches that implement the SD-layer-insertion idea specifically for LSTM-RNNs. Using these approaches, speaker adaptation improves word error rates by 3-4% relative over a strong LSTM-RNN baseline. This improvement is enlarged to 6-7% if we exploit SA features for further adaptation.
引用
收藏
页码:1101 / 1105
页数:5
相关论文
共 50 条
  • [1] SPEECH ENHANCEMENT USING LONG SHORT-TERM MEMORY BASED RECURRENT NEURAL NETWORKS FOR NOISE ROBUST SPEAKER VERIFICATION
    Kolbaek, Morten
    Tan, Zheng-Hua
    Jensen, Jesper
    [J]. 2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 305 - 311
  • [2] Detecting Overlapping Speech with Long Short-Term Memory Recurrent Neural Networks
    Geiger, Juergen T.
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1667 - 1671
  • [3] Simplified Gating in Long Short-term Memory (LSTM) Recurrent Neural Networks
    Lu, Yuzhen
    Salem, Fathi M.
    [J]. 2017 IEEE 60TH INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS (MWSCAS), 2017, : 1601 - 1604
  • [4] Long Short-Term Memory Based Recurrent Neural Networks for Collaborative Filtering
    Zou, Lixin
    Gu, Yulong
    Song, Jiaxing
    Liu, Weidong
    Yao, Yuan
    [J]. 2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [5] Long Short-Term Memory Recurrent Neural Networks for Antibacterial Peptide Identification
    Youmans, Michael
    Spainhour, Christian
    Qiu, Peng
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 498 - 502
  • [6] Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks
    Bollepalli, Bajibabu
    Juvela, Lauri
    Airaksinen, Manu
    Valentini-Botinhao, Cassia
    Alku, Paavo
    [J]. SPEECH COMMUNICATION, 2019, 110 : 64 - 75
  • [7] Long and Short-Term Recommendations with Recurrent Neural Networks
    Devooght, Robin
    Bersini, Hugues
    [J]. PROCEEDINGS OF THE 25TH CONFERENCE ON USER MODELING, ADAPTATION AND PERSONALIZATION (UMAP'17), 2017, : 13 - 21
  • [8] Classification of Antibacterial Peptides Using Long Short-Term Memory Recurrent Neural Networks
    Youmans, Michael
    Spainhour, John C. G.
    Qiu, Peng
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (04) : 1134 - 1140
  • [9] Session Based Recommendations Using Recurrent Neural Networks - Long Short-Term Memory
    Dobrovolny, Michal
    Selamat, Ali
    Krejcar, Ondrej
    [J]. INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2021, 2021, 12672 : 53 - 65
  • [10] An analysis of Convolutional Long Short-Term Memory Recurrent Neural Networks for gesture recognition
    Tsironi, Eleni
    Barros, Pablo
    Weber, Cornelius
    Wermter, Stefan
    [J]. NEUROCOMPUTING, 2017, 268 : 76 - 86