Application of the modified group delay function to speaker identification and discrimination

被引：0

作者：

Hegde, RM ^{[1
]}

Murthy, HA ^{[1
]}

Rao, GVR ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras, Tamil Nadu, India

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we explore new methods by which speakers can be identified and discriminated, using features derived from the fourier transform phase. The Modified Group Delay Feature(MODGDF) which is a parameterized form of the modified group delay function is used as a front end feature in this study. A Gaussian mixture model(GMM) based speaker identification system is built with the MODGDF as the front end feature. The system is tested on both clean (TIMIT) and noisy telephone(NTIMIT) speech. The results obtained are compared with traditional Mel frequency cepstral coefficients(MFCC) which is derived from the fourier transform magnitude. When both MFCC and MODGDF were combined, the performance improved by about 4% indicating that both phase and magnitude contain complementary information. In an earlier paper [1], it was shown that the MODGDF does possess phoneme specific characteristics. In this paper we show that the MODGDF has speaker specific properties. We also make an attempt to understand speaker discriminating characteristics of the MODGDF using the nonlinear mapping technique based on Sammon mapping [10] and find that the MODGDF empirically demonstrates a certain level of linear separability among speakers.

引用

页码：517 / +

页数：2

共 50 条

[21] Application of GMM in the speaker identification system
Zeng Chun
Li Zhong
2011 7TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING (WICOM), 2011,
[22] A NOVEL APPLICATION OF GROUP DELAY FUNCTION FOR IDENTIFYING TONIC IN CARNATIC MUSIC
Bellur, Ashwin
Murthy, Hema A.
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[23] Speaker Identification System Based in Verification Techniques with Bayesian Discrimination
Vera, Matias
Pelle, Patricia
Estienne, Claudio
Ferrer, Luciana
2015 XVI WORKSHOP ON INFORMATION PROCESSING AND CONTROL (RPIC), 2015,
[24] GROUP NONNEGATIVE MATRIX FACTORISATION WITH SPEAKER AND SESSION VARIABILITY COMPENSATION FOR SPEAKER IDENTIFICATION
Serizel, Romain
Essid, Slim
Richard, Gael
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5470 - 5474
[25] Combine multiple time-delay HMEs for speaker identification
Chen, K
Xie, DH
Chi, HS
ICNN - 1996 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS. 1-4, 1996, : 2015 - 2020
[26] On parametric representations of the modified group delay
Padmanabhan, R.
Murthy, Hema A.
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1185 - 1188
[27] Application of KPCA and PNN for robust speaker identification
Ren, Xue-Hui
Zhang, Ya-Fen
Xing, Yu-Juan
Li, Ming
CISP 2008: FIRST INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOL 4, PROCEEDINGS, 2008, : 533 - 536
[28] An application of a formula of Alberto Calderon to speaker identification
Daubechies, I
Maes, S
HARMONIC ANALYSIS AND PARTIAL DIFFERENTIAL EQUATIONS: ESSAYS IN HONOR OF ALBERTO P CALDERON, 1999, : 163 - 181
[29] An application of fuzzy entropy clustering in speaker identification
Tran, D
Wagner, M
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 215 - 218
[30] An application of fuzzy entropy clustering in speaker identification
Tran, D
Wagner, M
PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 228 - 231

← 1 2 3 4 5 →