Application of the modified group delay function to speaker identification and discrimination

被引:0
|
作者
Hegde, RM [1 ]
Murthy, HA [1 ]
Rao, GVR [1 ]
机构
[1] Indian Inst Technol, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we explore new methods by which speakers can be identified and discriminated, using features derived from the fourier transform phase. The Modified Group Delay Feature(MODGDF) which is a parameterized form of the modified group delay function is used as a front end feature in this study. A Gaussian mixture model(GMM) based speaker identification system is built with the MODGDF as the front end feature. The system is tested on both clean (TIMIT) and noisy telephone(NTIMIT) speech. The results obtained are compared with traditional Mel frequency cepstral coefficients(MFCC) which is derived from the fourier transform magnitude. When both MFCC and MODGDF were combined, the performance improved by about 4% indicating that both phase and magnitude contain complementary information. In an earlier paper [1], it was shown that the MODGDF does possess phoneme specific characteristics. In this paper we show that the MODGDF has speaker specific properties. We also make an attempt to understand speaker discriminating characteristics of the MODGDF using the nonlinear mapping technique based on Sammon mapping [10] and find that the MODGDF empirically demonstrates a certain level of linear separability among speakers.
引用
收藏
页码:517 / +
页数:2
相关论文
共 50 条
  • [31] Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
    Rajesh M. Hegde
    Hema A. Murthy
    V. R. R. Gadde
    EURASIP Journal on Audio, Speech, and Music Processing, 2007
  • [32] Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing
    Hegde, Rajesh M.
    Murthy, Hema A.
    Gadde, V. R. R.
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2007, 2007 (1)
  • [33] Speech processing using joint features derived from the modified group delay function
    Hegde, RM
    Murthy, HA
    Rao, GVR
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 541 - 544
  • [34] Small group speaker identification with common password phrases
    Rosenberg, AE
    Siohan, O
    Parthasarathy, S
    SPEECH COMMUNICATION, 2000, 31 (2-3) : 131 - 140
  • [35] Parameter discrimination analysis in speaker identification using self organizing map
    Pan, Y
    Hu, QX
    Wu, WH
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 273 - 278
  • [36] A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR
    Morrone, Giovanni
    Zovato, Enrico
    Brugnara, Fabio
    Sartori, Enrico
    Badino, Leonardo
    INTERSPEECH 2024, 2024, : 3652 - 3653
  • [37] Automatic Speaker Localization based on Speaker Identification -A Smart Room Application-
    Ouamour, Siham
    Sayoud, Halim
    2013 FOURTH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY AND ACCESSIBILITY (ICTA), 2013,
  • [38] Monaural Speaker Segregation Using Group Delay Spectral Matrix Factorization
    Nathwani, Karan
    Kumar, Anurag
    Hegde, Rajesh M.
    2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2014,
  • [39] The Use of Group Delay Features of Linear Prediction Model for Speaker Recognition
    Bastys, Algirdas
    Kisel, Andrej
    Salna, Bernardas
    INFORMATICA, 2010, 21 (01) : 1 - 12
  • [40] Speaker discrimination as a function of vowel realization: does focus affect perception?
    Heeren, Willemijn
    Voeten, Cesko
    Marks, Tessi
    DUTCH JOURNAL OF APPLIED LINGUISTICS, 2022, 11