SPEAKER ADAPTIVE TRAINING USING DEEP NEURAL NETWORKS

被引：0

作者：

Ochiai, Tsubasa ^{[1
,2
]}

Matsuda, Shigeki ^{[1
]}

Lu, Xugang ^{[1
]}

Hori, Chiori ^{[1
]}

Katagiri, Shigeru ^{[2
]}

机构：

[1] Natl Inst Informat & Commun Technol, Spoken Language Commun Lab, Kyoto, Japan

[2] Doshisha Univ, Grad Sch Engn, Kyoto, Japan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

Speaker Adaptative Training; Deep Neural Network; ADAPTATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Among many speaker adaptation embodiments, Speaker Adaptive Training (SAT) has been successfully applied to a standard Hidden-Markov-Model (HMM) speech recognizer, whose state is associated with Gaussian Mixture Models (GMMs). On the other hand, recent studies on Speaker-Independent (SI) recognizer development have reported that a new type of HMM speech recognizer, which replaces GMMs with Deep Neural Networks (DNNs), outperforms GMM-HMM recognizers. Along these two lines, it is natural to conceive of further improvement to a preset DNN-HMM recognizer by employing SAT. In this paper, we propose a novel training scheme that applies SAT to a SI DNN-HMM recognizer. We then implement the SAT scheme by allocating a Speaker-Dependent (SD) module to one of the intermediate layers of a seven-layer DNN, and elaborate its utility over TED Talks corpus data. Experiment results show that our speaker-adapted SAT-based DNN-HMM recognizer reduces the word error rate by 8.4% more than that of a baseline SI DNN-HMM recognizer, and (regardless of the SD module allocation) outperforms the conventional speaker adaptation scheme. The results also show that the inner layers of DNN are more suitable for the SD module than the outer layers.

引用

页数：5

共 50 条

[41] Deep Neural Networks for Multiple Speaker Detection and Localization
He, Weipeng
Motlicek, Petr
Odobez, Jean-Marc
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 74 - 79
[42] Speaker2Vec: Unsupervised Learning and Adaptation of a Speaker Manifold using Deep Neural Networks with an Evaluation on Speaker Segmentation
Jati, Arindam
Georgiou, Panayiotis
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3567 - 3571
[43] Exploiting nonlinear dendritic adaptive computation in training deep Spiking Neural Networks
Shen, Guobin
Zhao, Dongcheng
Zeng, Yi
NEURAL NETWORKS, 2024, 170 : 190 - 201
[44] AdaXod: a new adaptive and momental bound algorithm for training deep neural networks
Liu, Yuanxuan
Li, Dequan
JOURNAL OF SUPERCOMPUTING, 2023, 79 (15): : 17691 - 17715
[45] AdaXod: a new adaptive and momental bound algorithm for training deep neural networks
Yuanxuan Liu
Dequan Li
The Journal of Supercomputing, 2023, 79 : 17691 - 17715
[46] Closing the Generalization Gap of Adaptive Gradient Methods in Training Deep Neural Networks
Chen, Jinghui
Zhou, Dongruo
Tang, Yiqi
Yang, Ziyan
Cao, Yuan
Gu, Quanquan
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3267 - 3275
[47] Adaptive Control of Robotic Manipulators using Deep Neural Networks
Ganie, Irfan
Jagannathan, S.
IFAC PAPERSONLINE, 2022, 55 (15): : 148 - 153
[48] Hierarchical Training of Deep Neural Networks Using Early Exiting
Sepehri, Yamin
Pad, Pedram
Yuzuguler, Ahmet Caner
Frossard, Pascal
Dunbar, L. Andrea
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 1 - 15
[49] Framework for the Training of Deep Neural Networks in TensorFlow Using Metaheuristics
Munoz-Ordonez, Julian
Cobos, Carlos
Mendoza, Martha
Herrera-Viedma, Enrique
Herrera, Francisco
Tabik, Siham
INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2018, PT I, 2018, 11314 : 801 - 811
[50] Training Deep Neural Networks Using Posit Number System
Lu, Jinming
Lu, Siyuan
Wang, Zhisheng
Fang, Chao
Lin, Jun
Wang, Zhongfeng
Du, Li
32ND IEEE INTERNATIONAL SYSTEM ON CHIP CONFERENCE (IEEE SOCC 2019), 2019, : 62 - 67

← 1 2 3 4 5 →