Stationary wavelet Filtering Cepstral coefficients (SWFCC) for robust speaker identification

被引:0
|
作者
Missaoui, Ibrahim [1 ,2 ]
Lachiri, Zied [1 ]
机构
[1] Signal, Images and Information Technologies Laboratory, LR-11-ES17, National Engineering School of Tunis (ENIT), University of Tunis El Manar, BP 37, le Belvédère, 1002, Tunis, Tunisia
[2] Higher Institute of Computer Science and Multimedia of Gabes, University of Gabes, Tunisia
关键词
Filter banks - Speech enhancement - Speech recognition - Wavelet analysis - Wavelet transforms;
D O I
10.1016/j.apacoust.2024.110435
中图分类号
学科分类号
摘要
Extracting robust effective speech features is one of the challenging topics in the speaker recognition field, especially in noisy conditions. It can substantially improve the robustness recognition accuracy of persons from their voice signals against such conditions. This paper proposes a new feature extraction approach called Stationary Wavelet Filtering Cepstral Coefficients (SWFCC) for noisy speaker recognition. The proposed approach incorporates a Stationary Wavelet Filterbank (SWF) and an Implicit Wiener Filtering (IWF) technique. The SWF is based on the stationary wavelet packet transform, which is a shift-invariant transform. The performance of the proposed SWFCC approach is evaluated on the TIMIT dataset in the presence of different types of environmental noise, which are taken from the Aurora dataset. Our experimental results using the Gaussian Mixture Model-Universal Background Model (GMM-UBM) as a classifier show that SWFCC outperforms various feature extraction techniques like MFCC, PNCC, and GFCC in terms of recognition accuracy. © 2024 Elsevier Ltd
引用
下载
收藏
相关论文
共 50 条
  • [1] Speaker identification using Kalman cepstral coefficients
    Svenda, Z
    Radová, V
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2000, 1902 : 295 - 300
  • [2] Cascaded Feedforward Neural Networks for speaker identification using Perceptual Wavelet based Cepstral Coefficients
    Renisha, G.
    Jayasree, T.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (01) : 1141 - 1153
  • [3] Gammatone Wavelet Cepstral Coefficients for Robust Speech Recognition
    Adiga, Aniruddha
    Magimai-Doss, Mathew
    Seelamantula, Chandra Sekhar
    2013 IEEE INTERNATIONAL CONFERENCE OF IEEE REGION 10 (TENCON), 2013,
  • [4] Cancelable speaker identification based on cepstral coefficients and comb filters
    Monir M.
    Kareem M.
    El-Dolil S.M.
    Saleeb A.
    El-Fishawy A.S.
    Nassar M.A.-E.
    Zein Eldin M.A.
    Abd El-Samie F.E.
    Int J Speech Technol, 2 (471-492): : 471 - 492
  • [5] Phase Based Mel Frequency Cepstral Coefficients for Speaker Identification
    Srivastava, Sumit
    Chandra, Mahesh
    Sahoo, G.
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 3, INDIA 2016, 2016, 435 : 309 - 316
  • [6] Bionic Cepstral coefficients (BCC): A new auditory feature extraction to noise-robust speaker identification
    Zouhir, Youssef
    Zarka, Mohamed
    Ouni, Kais
    APPLIED ACOUSTICS, 2024, 221
  • [7] Modified Mel-frequency Cepstral Coefficients (MMFCC) in Robust Text-dependent Speaker Identification
    Islam, Md. Atiqul
    2017 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2017, : 505 - 509
  • [8] A Wavelet Packet and Mel-Frequency Cepstral Coefficients-Based Feature Extraction Method for Speaker Identification
    Turner, Claude
    Joseph, Anthony
    COMPLEX ADAPTIVE SYSTEMS, 2015, 2015, 61 : 416 - 421
  • [9] Gammatone Frequency Cepstral Coefficients for Speaker Identification over VoIP Networks
    Bouziane, Ayoub
    Kharroubi, Jamal
    Zarghili, Arsalane
    2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [10] A late fusion deep neural network for robust speaker identification using raw waveforms and gammatone cepstral coefficients
    Salvati, Daniele
    Drioli, Carlo
    Foresti, Gian Luca
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222