Segment unit shuffling layer in deep neural networks for text-independent speaker verification

被引：0

作者：

Heo, Jungwoo ^{[1
]}

Shim, Hye-jin ^{[1
]}

Kim, Ju-ho ^{[1
]}

Yu, Ha-Jin ^{[1
]}

机构：

[1] Univ Seoul, Coll Engn, Sch Comp Sci, 163 Siripdae Ro, Seoul 02504, South Korea

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA | 2021年 / 40卷 / 02期

关键词：

Text-independent speaker verification; Deep neural network; Speaker embedding; Shuffling generalization;

D O I：

10.7776/ASK.2021.40.2.148

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Text-Independent speaker verification needs to extract text-independent speaker embedding to improve generalization performance. However, deep neural networks that depend on training data have the potential to overfit text information instead of learning the speaker information when repeatedly learning from the identical time series. In this paper, to prevent the overfitting, we propose a segment unit shuffling layer that divides and rearranges the input layer or a hidden layer along the time axis, thus mixes the time series information. Since the segment unit shuffling layer can be applied not only to the input layer but also to the hidden layers, it can be used as generalization technique in the hidden layer, which is known to be effective compared to the generalization technique in the input layer, and can be applied simultaneously with data augmentation. In addition, the degree of distortion can be adjusted by adjusting the unit size of the segment. We observe that the performance of text-independent speaker verification is improved compared to the baseline when the proposed segment unit shuffling layer is applied.

引用

页码：148 / 154

页数：7

共 50 条

[1] Deep Neural Network Embeddings for Text-Independent Speaker Verification
Snyder, David
Garcia-Romero, Daniel
Povey, Daniel
Khudanpur, Sanjeev
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 999 - 1003
[2] Text-independent speaker verification using predictive neural networks
Finan, RA
Sapeluk, AT
Damper, RI
[J]. FIFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL NEURAL NETWORKS, 1997, (440): : 274 - 279
[3] Acoustic Feature Shuffling Network for Text-Independent Speaker Verification
Li, Jin
Fang, Xin
Chu, Fan
Gao, Tian
Song, Yan
Dai, Lirong
[J]. INTERSPEECH 2022, 2022, : 4790 - 4794
[4] Text-Independent Speaker Verification Based on Deep Neural Networks and Segmental Dynamic TimeWarping
Adel, Mohamed
Afify, Mohamed
Gaballah, Akram
Fayek, Magda
[J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 1001 - 1006
[5] Deep Speaker Feature Learning for Text-independent Speaker Verification
Li, Lantian
Chen, Yixiang
Shi, Zing
Tang, Zhiyuan
Wang, Dong
[J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1542 - 1546
[6] Neural Embedding Extractors for Text-Independent Speaker Verification
Alam, Jahangir
Kang, Woohyun
Fathan, Abderrahim
[J]. SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 10 - 23
[7] Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification
You, Lanhua
Guo, Wu
Dai, Li-Rong
Du, Jun
[J]. INTERSPEECH 2019, 2019, : 1168 - 1172
[8] Generalized locally recurrent probabilistic neural networks for text-independent speaker verification
Ganchev, T
Fakotakis, N
Tasoulis, DK
Vrahatis, MN
[J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 41 - 44
[9] Modified layer deep convolution neural network for text-independent speaker recognition
Karthikeyan, V
Priyadharsini, Suja S.
[J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024, 36 (02) : 273 - 285
[10] A tutorial on text-independent speaker verification
Bimbot, F
Bonastre, JF
Fredouille, C
Gravier, G
Magrin-Chagnolleau, I
Meignier, S
Merlin, T
Ortega-García, J
Petrovska-Delacrétaz, D
Reynolds, DA
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451

← 1 2 3 4 5 →