Enhancing Speaker Recognition Models with Noise-Resilient Feature Optimization Strategies

被引:2
|
作者
Chauhan, Neha [1 ]
Isshiki, Tsuyoshi [1 ]
Li, Dongju [1 ]
机构
[1] Tokyo Inst Technol, Dept Informat & Commun Engn, Tokyo 1528550, Japan
来源
ACOUSTICS | 2024年 / 6卷 / 02期
关键词
speaker identification; speaker verification; feature-level fusion; dimension reduction; feature optimization; PCA;
D O I
10.3390/acoustics6020024
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper delves into an in-depth exploration of speaker recognition methodologies, with a primary focus on three pivotal approaches: feature-level fusion, dimension reduction employing principal component analysis (PCA) and independent component analysis (ICA), and feature optimization through a genetic algorithm (GA) and the marine predator algorithm (MPA). This study conducts comprehensive experiments across diverse speech datasets characterized by varying noise levels and speaker counts. Impressively, the research yields exceptional results across different datasets and classifiers. For instance, on the TIMIT babble noise dataset (120 speakers), feature fusion achieves a remarkable speaker identification accuracy of 92.7%, while various feature optimization techniques combined with K nearest neighbor (KNN) and linear discriminant (LD) classifiers result in a speaker verification equal error rate (SV EER) of 0.7%. Notably, this study achieves a speaker identification accuracy of 93.5% and SV EER of 0.13% on the TIMIT babble noise dataset (630 speakers) using a KNN classifier with feature optimization. On the TIMIT white noise dataset (120 and 630 speakers), speaker identification accuracies of 93.3% and 83.5%, along with SV EER values of 0.58% and 0.13%, respectively, were attained utilizing PCA dimension reduction and feature optimization techniques (PCA-MPA) with KNN classifiers. Furthermore, on the voxceleb1 dataset, PCA-MPA feature optimization with KNN classifiers achieves a speaker identification accuracy of 95.2% and an SV EER of 1.8%. These findings underscore the significant enhancement in computational speed and speaker recognition performance facilitated by feature optimization strategies.
引用
收藏
页码:439 / 469
页数:31
相关论文
共 50 条
  • [1] Exploiting Multilabel Information for Noise-Resilient Feature Selection
    Jian, Ling
    Li, Jundong
    Liu, Huan
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2018, 9 (05)
  • [2] Large-Scale Noise-Resilient Evolution-Strategies
    Krause, Oswin
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 682 - 690
  • [3] Semi-supervised noise-resilient anomaly detection with feature autoencoder
    Zhu, Tianyi
    Liu, Lina
    Sun, Yibo
    Lu, Zhi
    Zhang, Yuanlong
    Xu, Chao
    Chen, Jun
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [4] Noise-resilient variational hybrid quantum-classical optimization
    Gentini, Laura
    Cuccoli, Alessandro
    Pirandola, Stefano
    Verrucchi, Paola
    Banchi, Leonardo
    PHYSICAL REVIEW A, 2020, 102 (05)
  • [5] Noise-resilient feature selection for accelerometer-based guyed tower monitoring
    de Oliveira, Juliane Regina
    Jimenez, German Efrain Casteneda
    Ferreira, Janito Vaqueiro
    de Lima, Eduardo Rodrigues
    de Almeida, Larissa Medeiros
    Wanner, Lucas
    INTERNET OF THINGS, 2025, 31
  • [6] NRGAN: A Noise-resilient GAN with adaptive feature modulation for SAR image segmentation
    Lian, Shuo
    Fan, Jianchao
    Wang, Jun
    PATTERN RECOGNITION, 2025, 164
  • [7] IMPROVED NOISE-RESILIENT ISOLATED WORDS SPEECH RECOGNITION USING PIECEWISE DIFFERENTIATION
    Al-Anzi, Fawaz S.
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2022, 30 (08)
  • [8] Dynamic adaptive quantum approximate optimization algorithm for shallow, noise-resilient circuits
    Yanakiev, Nikola
    Mertig, Normann
    Long, Christopher K.
    Arvidsson-Shukur, David R. M.
    PHYSICAL REVIEW A, 2024, 109 (03)
  • [9] The noise-resilient brain: Resting-state oscillatory activity predicts words-in-noise recognition
    Houweling, Thomas
    Becker, Robert
    Hervais-Adelman, Alexis
    BRAIN AND LANGUAGE, 2020, 202
  • [10] Optimizing telescoped heterogeneous catalysis with noise-resilient multi-objective Bayesian optimization
    Luo, Guihua
    Yang, Xilin
    Su, Weike
    Qi, Tingting
    Xu, Qilin
    Su, An
    CHEMICAL ENGINEERING SCIENCE, 2024, 298