Speaker identification based on GMM with embedded AANN

被引：0

作者：

Chen C.-B. ^{[1
]}

Zhao L. ^{[1
]}

机构：

[1] School of Information Science and Engineering, Southeast University

来源：

Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology | 2010年 / 32卷 / 03期

关键词：

Auto-Associate Neural Network (AANN); Embedded; Gaussian Mixed Model (GMM); Speaker identification;

D O I：

10.3724/SP.J.1146.2008.00275

中图分类号：

学科分类号：

摘要：

In this paper, a modified Gaussian Mixed Model (GMM) with an embedded Auto-Associate Neural Network (AANN) is proposed. It integrates the merits of GMM and AANN. GMM and AANN as a whole are trained by means of Maximum Likelihood (ML). In the process of training, the parameters of GMM and AANN are updated alternately. AANN reshapes the distribution of the data and improves the similarity of the data in one class. Experiments show that the proposed system improves accuracy rate against baseline GMM at all SNR, maximum to 19%.

引用

下载

页码：528 / 532

页数：4

共 16 条

[1] Zhao L., Speech Signal Processing, pp. 236-253, (2003)
[2] Campbell J.P., Speaker recognition: A tutorial, Proceedings of the IEEE, 85, 9, pp. 1437-1462, (1997)
[3] Bimbot F., Bonastre J.F., Fredouille C., Et al., A tutorial on text-independent speaker verification, EURASIP Journal on Applied Signal Processing, 2004, 4, pp. 430-451, (2004)
[4] Reynolds D.A., Rose R.C., Robust text-independent speaker identification using Gaussian mixture models, IEEE Transactions on Speech Audio Processing, 3, 1, pp. 72-83, (1995)
[5] Reynolds D.A., Quatieri T., Dunn R., Speaker verification using adapted Gaussian mixture models, Digital Signal Processing, 10, 1, pp. 19-41, (2000)
[6] Kwon S., Narayanan S., Robust speaker identification based on selective use of feature vectors, Pattern Recognition Letters, 28, 1, pp. 85-89, (2007)
[7] Campbell W.M., Sturim D.E., Reynolds D.A., SVM based speaker verification using a GMM supervector kernel and NAP variability compensation, Proceedings of ICASSP, pp. 97-100, (2006)
[8] Yin S.-C., Rose R., Kenny P., A joint factor analysis approach to progressive model adaptation in text-independent speaker verification, IEEE Transactions on Audio, Speech and Language Processing, 15, 7, pp. 1999-2110, (2007)
[9] Mak M.W., Allen W.G., Sexton G.G., Speaker identification using multilayer perceptron and radial basis function networks, Neurocomputing, 6, 1, pp. 99-117, (1994)
[10] Bennani Y., Gallinari P., On the use of TDNN-extracted features information in talker identification, Proceedings of ICASSP, pp. 385-388, (1991)

← 1 2 →