Perceptual MVDR-based cepstral coefficients(PMCCs)for speaker recognition

被引:2
|
作者
LIANG Chunyan ZHANG Xiang YANG Lin ZHANG Jianping YAN Yonghong (Key Laboratory of Speech Acoustics and Content Understanding
机构
基金
中国国家自然科学基金;
关键词
MVDR; LP; Perceptual MVDR-based cepstral coefficients;
D O I
10.15949/j.cnki.0217-9776.2012.04.004
中图分类号
TN912.34 [语音识别与设备];
学科分类号
0711 ;
摘要
A feature extraction technique named perceptual MVDR-based cepstral coefficients (PMCCs) was introduced into speaker recognition.PMCCs are extracted and modeled using Gaussian Mixture Models(GMMs) for speaker recognition.In order to compensate for speaker and channel variability effects,joint factor analysis(JFA) is used.The experiments are carried out on the core conditions of NIST 2008 speaker recognition evaluation data.The experimental results show that the systems based on PMCCs can achieve comparable performance to those based on the conventional MFCCs.Besides,the fusion of the two kinds of systems can make significant performance improvement compared to the MFCCs system alone,reducing equal error rate(EER) by the factor between 7.6%and 30.5%as well as minimum detect cost function (minDCF) by the factor between 3.2%and 21.2%on different test sets.The results indicate that PMCCs can be effectively applied in speaker recognition and they are complementary with MFCCs to some extent.
引用
收藏
页码:489 / 498
页数:10
相关论文
共 50 条
  • [31] A new perceptually motivated MVDR-based acoustic front-end (PMVDR) for robust automatic speech recognition
    Yapanel, Umit H.
    Hansen, John H. L.
    [J]. SPEECH COMMUNICATION, 2008, 50 (02) : 142 - 152
  • [32] WAVELET BASED CEPSTRAL COEFFICIENTS FOR NEURAL NETWORK SPEECH RECOGNITION
    Adam, T. B.
    Salam, M. S.
    Gunawan, T. S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS (IEEE ICSIPA 2013), 2013, : 447 - 451
  • [33] Speaker Dependent Coefficients for Speaker Recognition
    Orsag, Filip
    [J]. INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2010, 4 (01): : 31 - 47
  • [34] Whispered speech recognition based on gammatone filterbank cepstral coefficients
    B. Marković
    J. Galić
    Ð. Grozdić
    S. T. Jovičić
    M. Mijić
    [J]. Journal of Communications Technology and Electronics, 2017, 62 : 1255 - 1261
  • [35] One Solution of Extension of Mel-Frequency Cepstral Coefficients Feature Vector for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    [J]. INFORMATION TECHNOLOGY AND CONTROL, 2020, 49 (02): : 224 - 236
  • [36] The use of Locally Normalized Cepstral Coefficients (LNCC) to improve speaker recognition accuracy in highly reverberant rooms
    Poblete, Victor
    Pablo Escudero, Juan
    Fredes, Josue
    Novoa, Jose
    Stern, Richard M.
    King, Simon
    Becerra Yoma, Nestor
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2373 - 2377
  • [37] SPEAKER RECOGNITION USING SYLLABLE-BASED CONSTRAINTS FOR CEPSTRAL FRAME SELECTION
    Bocklet, Tobias
    Shriberg, Elizabeth
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4525 - +
  • [38] Reducing the environmental sensitivity of cepstral features for speaker recognition
    Openshaw, JP
    Mason, JS
    [J]. ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 721 - 724
  • [39] Mel Frequency Cepstral Coefficients Based Similar Albanian Phonemes Recognition
    Karahoda, Bertan
    Pireva, Krenare
    Imran, Ali Shariq
    [J]. HUMAN INTERFACE AND THE MANAGEMENT OF INFORMATION: INFORMATION, DESIGN AND INTERACTION, PT I, 2016, 9734 : 491 - 500
  • [40] MVDR-Based Coherence Weighting for High-Frame-Rate Adaptive Imaging
    Wang, Shun-Li
    Li, Pai-Chi
    [J]. IEEE TRANSACTIONS ON ULTRASONICS FERROELECTRICS AND FREQUENCY CONTROL, 2009, 56 (10) : 2097 - 2110