Stationary wavelet Filtering Cepstral coefficients (SWFCC) for robust speaker identification

被引：0

作者：

Missaoui, Ibrahim ^{[1
,2
]}

Lachiri, Zied ^{[1
]}

机构：

[1] Signal, Images and Information Technologies Laboratory, LR-11-ES17, National Engineering School of Tunis (ENIT), University of Tunis El Manar, BP 37, le Belvédère, 1002, Tunis, Tunisia

[2] Higher Institute of Computer Science and Multimedia of Gabes, University of Gabes, Tunisia

来源：

Applied Acoustics | 2025年 / 231卷

关键词：

Filter banks - Speech enhancement - Speech recognition - Wavelet analysis - Wavelet transforms;

D O I：

10.1016/j.apacoust.2024.110435

中图分类号：

学科分类号：

摘要：

Extracting robust effective speech features is one of the challenging topics in the speaker recognition field, especially in noisy conditions. It can substantially improve the robustness recognition accuracy of persons from their voice signals against such conditions. This paper proposes a new feature extraction approach called Stationary Wavelet Filtering Cepstral Coefficients (SWFCC) for noisy speaker recognition. The proposed approach incorporates a Stationary Wavelet Filterbank (SWF) and an Implicit Wiener Filtering (IWF) technique. The SWF is based on the stationary wavelet packet transform, which is a shift-invariant transform. The performance of the proposed SWFCC approach is evaluated on the TIMIT dataset in the presence of different types of environmental noise, which are taken from the Aurora dataset. Our experimental results using the Gaussian Mixture Model-Universal Background Model (GMM-UBM) as a classifier show that SWFCC outperforms various feature extraction techniques like MFCC, PNCC, and GFCC in terms of recognition accuracy. © 2024 Elsevier Ltd

引用

下载

共 50 条

[21] A robust speaker identification system based on wavelet transform
Hsieh, CT
Wang, YC
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (07): : 839 - 846
[22] Speaker identification using cepstral analysis
Nazar, MN
ISCON 2002: IEEE STUDENTS CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2002, : 139 - 143
[23] Speaker Identification Using Linear Predictive Cepstral Coefficients And General Regression Neural Network
Li, Penghua
Hu, Fangchao
Li, Yinguo
Xu, Yang
2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 4952 - 4956
[24] Wavelet Packet Based Mel Frequency Cepstral Features for Text Independent Speaker Identification
Srivastava, Smriti
Bhardwaj, Saurabh
Bhandari, Abhishek
Gupta, Krit
Bahl, Hitesh
Gupta, J. R. P.
INTELLIGENT INFORMATICS, 2013, 182 : 237 - 247
[25] Power Normalized Gammachirp Cepstral (PNGC) coefficients-based approach for robust speaker recognition
Zouhir, Youssef
Zarka, Mohamed
Supervision, Kais Ouni
APPLIED ACOUSTICS, 2023, 205
[26] Wavelet Cepstral Coefficients for Electrical Appliances Identification using Hidden Markov Models
Hacine-Gharbi, Abdenour
Ravier, Philippe
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS (ICPRAM 2018), 2018, : 541 - 549
[27] Perceptual MVDR-based cepstral coefficients for speaker recognition
Liang, Chunyan
Zhang, Xiang
Yang, Lin
Zhang, Jianping
Yan, Yonghong
Shengxue Xuebao/Acta Acustica, 2012, 37 (06): : 673 - 678
[28] Mel Frequency Cepstral Coefficients (MFCC) Based Speaker Identification in Noisy Environment Using Wiener Filter
Chauhan, Paresh M.
Desai, Nikita P.
2014 INTERNATIONAL CONFERENCE ON GREEN COMPUTING COMMUNICATION AND ELECTRICAL ENGINEERING (ICGCCEE), 2014,
[29] Robust speech features based on wavelet transform with application to speaker identification
Hsieh, CT
Lai, E
Wang, YC
IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 2002, 149 (02): : 108 - 114
[30] A robust wavelet-based text-independent speaker identification
Phung Trung Nghia
Pham Viet Binh
Nguyen Huu Thai
Nguyen Thanh Ha
Kumsawat, Prayoth
ICCIMA 2007: INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, VOL II, PROCEEDINGS, 2007, : 219 - 223

← 1 2 3 4 5 →