Stationary wavelet Filtering Cepstral coefficients (SWFCC) for robust speaker identification

被引:0
|
作者
Missaoui, Ibrahim [1 ,2 ]
Lachiri, Zied [1 ]
机构
[1] Signal, Images and Information Technologies Laboratory, LR-11-ES17, National Engineering School of Tunis (ENIT), University of Tunis El Manar, BP 37, le Belvédère, 1002, Tunis, Tunisia
[2] Higher Institute of Computer Science and Multimedia of Gabes, University of Gabes, Tunisia
关键词
Filter banks - Speech enhancement - Speech recognition - Wavelet analysis - Wavelet transforms;
D O I
10.1016/j.apacoust.2024.110435
中图分类号
学科分类号
摘要
Extracting robust effective speech features is one of the challenging topics in the speaker recognition field, especially in noisy conditions. It can substantially improve the robustness recognition accuracy of persons from their voice signals against such conditions. This paper proposes a new feature extraction approach called Stationary Wavelet Filtering Cepstral Coefficients (SWFCC) for noisy speaker recognition. The proposed approach incorporates a Stationary Wavelet Filterbank (SWF) and an Implicit Wiener Filtering (IWF) technique. The SWF is based on the stationary wavelet packet transform, which is a shift-invariant transform. The performance of the proposed SWFCC approach is evaluated on the TIMIT dataset in the presence of different types of environmental noise, which are taken from the Aurora dataset. Our experimental results using the Gaussian Mixture Model-Universal Background Model (GMM-UBM) as a classifier show that SWFCC outperforms various feature extraction techniques like MFCC, PNCC, and GFCC in terms of recognition accuracy. © 2024 Elsevier Ltd
引用
下载
收藏
相关论文
共 50 条
  • [31] A Study on Speaker Identification Approach by Feature Matching Algorithm using Pitch and Mel Frequency Cepstral Coefficients
    Prasetio, Barlian Henryranu
    Sakurai, Keiko
    Tamura, Hiroki
    Tanno, Koichi
    ICAROB 2019: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2019, : 475 - 478
  • [32] Mean Hilbert envelope coefficients (MHEC) for robust speaker and language identification
    Sadjadi, Seyed Omid
    Hansen, John H. L.
    SPEECH COMMUNICATION, 2015, 72 : 138 - 148
  • [33] Speaker Identification using Warped MVDR Cepstral Features
    Woelfel, Matthias
    Yang, Qian
    Jin, Qin
    Schultz, Tanja
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 904 - +
  • [34] Perceptual MVDR-Based Cepstral Coefficients (PMCCs) for Speaker Recognition
    Liang, Chunyan
    Zhang, Xiang
    Yang, Lin
    Zhang, Jianping
    Yan, Yonghong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 1386 - 1389
  • [35] Constant Q cepstral coefficients: A spoofing countermeasure for automatic speaker verification
    Todisco, Massimiliano
    Delgado, Hector
    Evans, Nicholas
    COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 516 - 535
  • [36] Wavelet Packet Sub-band Cepstral Coefficient for Speaker Verification
    Min, Hang
    Wei, Guangcun
    Xu, Yunfei
    Zhang, Yanna
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 1713 - 1717
  • [37] Perceptual MVDR-based cepstral coefficients(PMCCs)for speaker recognition
    LIANG Chunyan ZHANG Xiang YANG Lin ZHANG Jianping YAN Yonghong (Key Laboratory of Speech Acoustics and Content Understanding
    Chinese Journal of Acoustics, 2012, 31 (04) : 489 - 498
  • [38] Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition
    Jokic, Ivan D.
    Jokic, Stevan D.
    Delic, Vlado D.
    Peric, Zoran H.
    2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR), 2015, : 419 - 424
  • [39] Damped Oscillator Cepstral Coefficients for Robust Speech Recognition
    Mitra, Vikramjit
    Franco, Horacio
    Graciarena, Martin
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 886 - 890
  • [40] Daubechies Wavelet Cepstral Coefficients for Parkinson's Disease Detection
    Zayrit, Soumaya
    Belhoussine Drissi, Taoufiq
    Ammoumou, Abdelkrim
    Nsiri, Benayad
    COMPLEX SYSTEMS, 2020, 29 (03): : 729 - 739