Application of the modified group delay function to speaker identification and discrimination

被引：0

作者：

Hegde, RM ^{[1
]}

Murthy, HA ^{[1
]}

Rao, GVR ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras, Tamil Nadu, India

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we explore new methods by which speakers can be identified and discriminated, using features derived from the fourier transform phase. The Modified Group Delay Feature(MODGDF) which is a parameterized form of the modified group delay function is used as a front end feature in this study. A Gaussian mixture model(GMM) based speaker identification system is built with the MODGDF as the front end feature. The system is tested on both clean (TIMIT) and noisy telephone(NTIMIT) speech. The results obtained are compared with traditional Mel frequency cepstral coefficients(MFCC) which is derived from the fourier transform magnitude. When both MFCC and MODGDF were combined, the performance improved by about 4% indicating that both phase and magnitude contain complementary information. In an earlier paper [1], it was shown that the MODGDF does possess phoneme specific characteristics. In this paper we show that the MODGDF has speaker specific properties. We also make an attempt to understand speaker discriminating characteristics of the MODGDF using the nonlinear mapping technique based on Sammon mapping [10] and find that the MODGDF empirically demonstrates a certain level of linear separability among speakers.

引用

页码：517 / +

页数：2

共 50 条

[1] Automatic language identification and discrimination using the modified group delay feature
Hegde, RM
Murthy, HA
2005 INTERNATIONAL CONFERENCE ON INTELLIGENT SENSING AND INFORMATION PROCESSING, PROCEEDINGS, 2005, : 395 - 399
[2] The modified group delay function and its application to phoneme recognition
Murthy, HA
Gadde, V
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 68 - 71
[3] A modified group vector quantization algorithm for speaker identification
Abu El-Yazeed, MF
Kader, NSA
El-Henawy, MM
PROCEEDINGS OF THE 46TH IEEE INTERNATIONAL MIDWEST SYMPOSIUM ON CIRCUITS & SYSTEMS, VOLS 1-3, 2003, : 629 - 632
[4] Group Delay Function For Improved Gender Identification
Lee, Kye-Hwan
Kang, Sang-Ick
Song, Ji-Hyun
Chang, Joon-Hyuk
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1513 - 1516
[5] Cluster and intrinsic dimensionality analysis of the modified group delay feature for speaker classification
Hegde, RM
Murthy, HA
NEURAL INFORMATION PROCESSING, 2004, 3316 : 1172 - 1178
[6] Group Delay Functions for Speaker Diarization
Yadav, Mohit
Sao, Anil Kumar
Dileep, A. D.
Rajan, Padmanabhan
2016 TWENTY SECOND NATIONAL CONFERENCE ON COMMUNICATION (NCC), 2016,
[7] Group delay features for speaker recognition
Thiruvaran, Tharmarajah
Ambikairajah, Eliathamby
Epps, Julien
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1113 - 1117
[8] Modified group delay feature based total variability space modelling for speaker recognition
Madikeri S.R.
Talambedu A.
Murthy H.A.
International Journal of Speech Technology, 2015, 18 (1) : 17 - 23
[9] A modified speaker clustering method for efficient speaker identification
Yan, JiaChang
Wang, Lei
2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,
[10] An improved MMSE estimator based modified group delay spectrum for Forensic Automatic Speaker Recognition
Djeghiour, Salim
Guerti, Mhania
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2021, 24 (03) : 687 - 699

← 1 2 3 4 5 →