Speaker identification based on GMM with embedded AANN

被引:0
|
作者
Chen C.-B. [1 ]
Zhao L. [1 ]
机构
[1] School of Information Science and Engineering, Southeast University
关键词
Auto-Associate Neural Network (AANN); Embedded; Gaussian Mixed Model (GMM); Speaker identification;
D O I
10.3724/SP.J.1146.2008.00275
中图分类号
学科分类号
摘要
In this paper, a modified Gaussian Mixed Model (GMM) with an embedded Auto-Associate Neural Network (AANN) is proposed. It integrates the merits of GMM and AANN. GMM and AANN as a whole are trained by means of Maximum Likelihood (ML). In the process of training, the parameters of GMM and AANN are updated alternately. AANN reshapes the distribution of the data and improves the similarity of the data in one class. Experiments show that the proposed system improves accuracy rate against baseline GMM at all SNR, maximum to 19%.
引用
下载
收藏
页码:528 / 532
页数:4
相关论文
共 16 条
  • [1] Zhao L., Speech Signal Processing, pp. 236-253, (2003)
  • [2] Campbell J.P., Speaker recognition: A tutorial, Proceedings of the IEEE, 85, 9, pp. 1437-1462, (1997)
  • [3] Bimbot F., Bonastre J.F., Fredouille C., Et al., A tutorial on text-independent speaker verification, EURASIP Journal on Applied Signal Processing, 2004, 4, pp. 430-451, (2004)
  • [4] Reynolds D.A., Rose R.C., Robust text-independent speaker identification using Gaussian mixture models, IEEE Transactions on Speech Audio Processing, 3, 1, pp. 72-83, (1995)
  • [5] Reynolds D.A., Quatieri T., Dunn R., Speaker verification using adapted Gaussian mixture models, Digital Signal Processing, 10, 1, pp. 19-41, (2000)
  • [6] Kwon S., Narayanan S., Robust speaker identification based on selective use of feature vectors, Pattern Recognition Letters, 28, 1, pp. 85-89, (2007)
  • [7] Campbell W.M., Sturim D.E., Reynolds D.A., SVM based speaker verification using a GMM supervector kernel and NAP variability compensation, Proceedings of ICASSP, pp. 97-100, (2006)
  • [8] Yin S.-C., Rose R., Kenny P., A joint factor analysis approach to progressive model adaptation in text-independent speaker verification, IEEE Transactions on Audio, Speech and Language Processing, 15, 7, pp. 1999-2110, (2007)
  • [9] Mak M.W., Allen W.G., Sexton G.G., Speaker identification using multilayer perceptron and radial basis function networks, Neurocomputing, 6, 1, pp. 99-117, (1994)
  • [10] Bennani Y., Gallinari P., On the use of TDNN-extracted features information in talker identification, Proceedings of ICASSP, pp. 385-388, (1991)