Group delay features for speaker recognition

被引:0
|
作者
Thiruvaran, Tharmarajah [1 ]
Ambikairajah, Eliathamby [1 ]
Epps, Julien [1 ]
机构
[1] Univ New S Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
关键词
speaker recognition; group delay;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Group delay is proposed as an effective means of representing spectral phase information as a feature in speaker recognition. Robustness of group delay features is difficult to achieve, since the spiky nature of the group delay masks the fine structure of the group delay. In this paper, two features based on group delay are proposed by reducing the effect of spikes with two different approaches. The first is log compression, to address the masking effects of the spikes, and the second is to use a sub-band based approach, where masking is restricted within certain bands containing the spikes. The purpose of this paper is to introduce different types of group delay feature extraction methods. The two features are evaluated on the cellular NIST 2001 database.
引用
收藏
页码:1113 / 1117
页数:5
相关论文
共 50 条
  • [1] LS Regularization of Group Delay Features for Speaker Recognition
    Kua, Jia Min Karen
    Epps, Julien
    Ambikairajah, Eliathamby
    Choi, Eric
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2851 - +
  • [2] The Use of Group Delay Features of Linear Prediction Model for Speaker Recognition
    Bastys, Algirdas
    Kisel, Andrej
    Salna, Bernardas
    INFORMATICA, 2010, 21 (01) : 1 - 12
  • [3] Modified Group Delay Features for Emotion Recognition
    Uthiraa, S.
    Pusuluri, Aditya
    Patil, Hemant A.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2023, 2023, 14301 : 321 - 330
  • [4] Local features for speaker recognition
    Paredes, R
    Vidal, E
    Casacuberta, F
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, PROCEEDINGS, 2004, 3138 : 1087 - 1095
  • [5] Emerging features for speaker recognition
    Ambikairajah, Eliathamby
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1690 - 1696
  • [6] SPEAKER RECOGNITION BY STATISTICAL FEATURES AND DYNAMIC FEATURES
    FURUI, S
    REVIEW OF THE ELECTRICAL COMMUNICATIONS LABORATORIES, 1982, 30 (03): : 467 - 482
  • [7] Group Delay Functions for Speaker Diarization
    Yadav, Mohit
    Sao, Anil Kumar
    Dileep, A. D.
    Rajan, Padmanabhan
    2016 TWENTY SECOND NATIONAL CONFERENCE ON COMMUNICATION (NCC), 2016,
  • [8] Using group delay functions from all-pole models for speaker recognition
    Rajan, Padmanabhan
    Kinnunen, Tomi
    Hanilci, Cemal
    Pohjalainen, Jouni
    Alku, Paavo
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2488 - 2492
  • [9] Modified group delay feature based total variability space modelling for speaker recognition
    Madikeri S.R.
    Talambedu A.
    Murthy H.A.
    International Journal of Speech Technology, 2015, 18 (1) : 17 - 23
  • [10] Study of harmonic features for the speaker recognition
    Univ of Maribor, Maribor, Slovenia
    Speech Commun, 4 (385-402):