Stationary wavelet Filtering Cepstral coefficients (SWFCC) for robust speaker identification

被引:0
|
作者
Missaoui, Ibrahim [1 ,2 ]
Lachiri, Zied [1 ]
机构
[1] Signal, Images and Information Technologies Laboratory, LR-11-ES17, National Engineering School of Tunis (ENIT), University of Tunis El Manar, BP 37, le Belvédère, 1002, Tunis, Tunisia
[2] Higher Institute of Computer Science and Multimedia of Gabes, University of Gabes, Tunisia
关键词
Filter banks - Speech enhancement - Speech recognition - Wavelet analysis - Wavelet transforms;
D O I
10.1016/j.apacoust.2024.110435
中图分类号
学科分类号
摘要
Extracting robust effective speech features is one of the challenging topics in the speaker recognition field, especially in noisy conditions. It can substantially improve the robustness recognition accuracy of persons from their voice signals against such conditions. This paper proposes a new feature extraction approach called Stationary Wavelet Filtering Cepstral Coefficients (SWFCC) for noisy speaker recognition. The proposed approach incorporates a Stationary Wavelet Filterbank (SWF) and an Implicit Wiener Filtering (IWF) technique. The SWF is based on the stationary wavelet packet transform, which is a shift-invariant transform. The performance of the proposed SWFCC approach is evaluated on the TIMIT dataset in the presence of different types of environmental noise, which are taken from the Aurora dataset. Our experimental results using the Gaussian Mixture Model-Universal Background Model (GMM-UBM) as a classifier show that SWFCC outperforms various feature extraction techniques like MFCC, PNCC, and GFCC in terms of recognition accuracy. © 2024 Elsevier Ltd
引用
下载
收藏
相关论文
共 50 条
  • [21] A robust speaker identification system based on wavelet transform
    Hsieh, CT
    Wang, YC
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07): : 839 - 846
  • [22] Speaker identification using cepstral analysis
    Nazar, MN
    ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143
  • [23] Speaker Identification Using Linear Predictive Cepstral Coefficients And General Regression Neural Network
    Li, Penghua
    Hu, Fangchao
    Li, Yinguo
    Xu, Yang
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 4952 - 4956
  • [24] Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification
    Srivastava, Smriti
    Bhardwaj, Saurabh
    Bhandari, Abhishek
    Gupta, Krit
    Bahl, Hitesh
    Gupta, J. R. P.
    INTELLIGENT INFORMATICS, 2013, 182 : 237 - 247
  • [25] Power Normalized Gammachirp Cepstral (PNGC) coefficients-based approach for robust speaker recognition
    Zouhir, Youssef
    Zarka, Mohamed
    Supervision, Kais Ouni
    APPLIED ACOUSTICS, 2023, 205
  • [26] Wavelet Cepstral Coefficients for Electrical Appliances Identification using Hidden Markov Models
    Hacine-Gharbi, Abdenour
    Ravier, Philippe
    PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 541 - 549
  • [27] Perceptual MVDR-based cepstral coefficients for speaker recognition
    Liang, Chunyan
    Zhang, Xiang
    Yang, Lin
    Zhang, Jianping
    Yan, Yonghong
    Shengxue Xuebao/Acta Acustica, 2012, 37 (06): : 673 - 678
  • [28] Mel Frequency Cepstral Coefficients (MFCC) Based Speaker Identification in Noisy Environment Using Wiener Filter
    Chauhan, Paresh M.
    Desai, Nikita P.
    2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
  • [29] Robust speech features based on wavelet transform with application to speaker identification
    Hsieh, CT
    Lai, E
    Wang, YC
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
  • [30] A robust wavelet-based text-independent speaker identification
    Phung Trung Nghia
    Pham Viet Binh
    Nguyen Huu Thai
    Nguyen Thanh Ha
    Kumsawat, Prayoth
    ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223