SPEAKER ADAPTIVE TRAINING USING DEEP NEURAL NETWORKS

被引:0
|
作者
Ochiai, Tsubasa [1 ,2 ]
Matsuda, Shigeki [1 ]
Lu, Xugang [1 ]
Hori, Chiori [1 ]
Katagiri, Shigeru [2 ]
机构
[1] Natl Inst Informat & Commun Technol, Spoken Language Commun Lab, Kyoto, Japan
[2] Doshisha Univ, Grad Sch Engn, Kyoto, Japan
关键词
Speaker Adaptative Training; Deep Neural Network; ADAPTATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Among many speaker adaptation embodiments, Speaker Adaptive Training (SAT) has been successfully applied to a standard Hidden-Markov-Model (HMM) speech recognizer, whose state is associated with Gaussian Mixture Models (GMMs). On the other hand, recent studies on Speaker-Independent (SI) recognizer development have reported that a new type of HMM speech recognizer, which replaces GMMs with Deep Neural Networks (DNNs), outperforms GMM-HMM recognizers. Along these two lines, it is natural to conceive of further improvement to a preset DNN-HMM recognizer by employing SAT. In this paper, we propose a novel training scheme that applies SAT to a SI DNN-HMM recognizer. We then implement the SAT scheme by allocating a Speaker-Dependent (SD) module to one of the intermediate layers of a seven-layer DNN, and elaborate its utility over TED Talks corpus data. Experiment results show that our speaker-adapted SAT-based DNN-HMM recognizer reduces the word error rate by 8.4% more than that of a baseline SI DNN-HMM recognizer, and (regardless of the SD module allocation) outperforms the conventional speaker adaptation scheme. The results also show that the inner layers of DNN are more suitable for the SD module than the outer layers.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Deep Neural Networks for Multiple Speaker Detection and Localization
    He, Weipeng
    Motlicek, Petr
    Odobez, Jean-Marc
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 74 - 79
  • [42] Speaker2Vec: Unsupervised Learning and Adaptation of a Speaker Manifold using Deep Neural Networks with an Evaluation on Speaker Segmentation
    Jati, Arindam
    Georgiou, Panayiotis
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3567 - 3571
  • [43] Exploiting nonlinear dendritic adaptive computation in training deep Spiking Neural Networks
    Shen, Guobin
    Zhao, Dongcheng
    Zeng, Yi
    NEURAL NETWORKS, 2024, 170 : 190 - 201
  • [44] AdaXod: a new adaptive and momental bound algorithm for training deep neural networks
    Liu, Yuanxuan
    Li, Dequan
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (15): : 17691 - 17715
  • [45] AdaXod: a new adaptive and momental bound algorithm for training deep neural networks
    Yuanxuan Liu
    Dequan Li
    The Journal of Supercomputing, 2023, 79 : 17691 - 17715
  • [46] Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks
    Chen, Jinghui
    Zhou, Dongruo
    Tang, Yiqi
    Yang, Ziyan
    Cao, Yuan
    Gu, Quanquan
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3267 - 3275
  • [47] Adaptive Control of Robotic Manipulators using Deep Neural Networks
    Ganie, Irfan
    Jagannathan, S.
    IFAC PAPERSONLINE, 2022, 55 (15): : 148 - 153
  • [48] Hierarchical Training of Deep Neural Networks Using Early Exiting
    Sepehri, Yamin
    Pad, Pedram
    Yuzuguler, Ahmet Caner
    Frossard, Pascal
    Dunbar, L. Andrea
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
  • [49] Framework for the Training of Deep Neural Networks in TensorFlow Using Metaheuristics
    Munoz-Ordonez, Julian
    Cobos, Carlos
    Mendoza, Martha
    Herrera-Viedma, Enrique
    Herrera, Francisco
    Tabik, Siham
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 801 - 811
  • [50] Training Deep Neural Networks Using Posit Number System
    Lu, Jinming
    Lu, Siyuan
    Wang, Zhisheng
    Fang, Chao
    Lin, Jun
    Wang, Zhongfeng
    Du, Li
    32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 62 - 67