Enhancing Speaker Recognition Models with Noise-Resilient Feature Optimization Strategies

被引:2
|
作者
Chauhan, Neha [1 ]
Isshiki, Tsuyoshi [1 ]
Li, Dongju [1 ]
机构
[1] Tokyo Inst Technol, Dept Informat & Commun Engn, Tokyo 1528550, Japan
来源
ACOUSTICS | 2024年 / 6卷 / 02期
关键词
speaker identification; speaker verification; feature-level fusion; dimension reduction; feature optimization; PCA;
D O I
10.3390/acoustics6020024
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper delves into an in-depth exploration of speaker recognition methodologies, with a primary focus on three pivotal approaches: feature-level fusion, dimension reduction employing principal component analysis (PCA) and independent component analysis (ICA), and feature optimization through a genetic algorithm (GA) and the marine predator algorithm (MPA). This study conducts comprehensive experiments across diverse speech datasets characterized by varying noise levels and speaker counts. Impressively, the research yields exceptional results across different datasets and classifiers. For instance, on the TIMIT babble noise dataset (120 speakers), feature fusion achieves a remarkable speaker identification accuracy of 92.7%, while various feature optimization techniques combined with K nearest neighbor (KNN) and linear discriminant (LD) classifiers result in a speaker verification equal error rate (SV EER) of 0.7%. Notably, this study achieves a speaker identification accuracy of 93.5% and SV EER of 0.13% on the TIMIT babble noise dataset (630 speakers) using a KNN classifier with feature optimization. On the TIMIT white noise dataset (120 and 630 speakers), speaker identification accuracies of 93.3% and 83.5%, along with SV EER values of 0.58% and 0.13%, respectively, were attained utilizing PCA dimension reduction and feature optimization techniques (PCA-MPA) with KNN classifiers. Furthermore, on the voxceleb1 dataset, PCA-MPA feature optimization with KNN classifiers achieves a speaker identification accuracy of 95.2% and an SV EER of 1.8%. These findings underscore the significant enhancement in computational speed and speaker recognition performance facilitated by feature optimization strategies.
引用
收藏
页码:439 / 469
页数:31
相关论文
共 50 条
  • [41] Beyond Granularity: Enhancing Continuous Sign Language Recognition with Granularity-Aware Feature Fusion and Attention Optimization
    Du, Yao
    Peng, Taiying
    Hu, Xiaohui
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [42] BrepMFR: Enhancing machining feature recognition in B-rep models through deep learning and domain adaptation
    Zhang, Shuming
    Guan, Zhidong
    Jiang, Hao
    Wang, Xiaodong
    Tan, Pingan
    COMPUTER AIDED GEOMETRIC DESIGN, 2024, 111
  • [43] Integrated optimization of underwater acoustic ship-radiated noise recognition based on two-dimensional feature fusion
    Ke, Xiaoquan
    Yuan, Fei
    Cheng, En
    APPLIED ACOUSTICS, 2020, 159
  • [44] A Low-Complexity Parabolic Lip Contour Model With Speaker Normalization for High-Level Feature Extraction in Noise-Robust Audiovisual Speech Recognition
    Borgstroem, Bengt Jonas
    Alwan, Abeer
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2008, 38 (06): : 1273 - 1280
  • [45] Multi-Objective Optimization with Homotopy-Based Strategies for Enhanced Multimodal Automatic Target Recognition Models
    Abraham, Sophia
    Cruz, Steve
    You, Suya
    Hauenstein, Jonathan D.
    Scheirer, Walter J.
    AUTOMATIC TARGET RECOGNITION XXXIV, 2024, 13039
  • [46] Innovative multi-class segmentation for brain tumor MRI using noise diffusion probability models and enhancing tumor boundary recognition
    Liu, Zengxin
    Ma, Caiwen
    She, Wenji
    Xie, Meilin
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [47] Knowledge-based manufacturability assessment for optimization of additive manufacturing processes based on automated feature recognition from CAD models
    Stavropoulos, Panagiotis
    Tzimanis, Konstantinos
    Souflas, Thanassis
    Bikas, Harry
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (02): : 993 - 1007
  • [48] Knowledge-based manufacturability assessment for optimization of additive manufacturing processes based on automated feature recognition from CAD models
    Panagiotis Stavropoulos
    Konstantinos Tzimanis
    Thanassis Souflas
    Harry Bikas
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 993 - 1007
  • [49] Enhancing Helmet Violation Detection and License Plate Recognition through Optimization of YOLOV8 Models with Edge Computing Integration
    Hoa Doan Nguyen Thanh
    Phu Nguyen Ngoc Thien
    Nghia Phan Duc
    Anh Nguyen Phan Tuan
    Nguyen Le Dinh
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 82 - 85
  • [50] Impact feature recognition method for non-stationary signals based on variational modal decomposition noise reduction and support vector machine optimized by whale optimization algorithm
    Xu, Fujing
    Hu, Linghua
    Jia, Tingwei
    Du, Shaocheng
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2021, 92 (12):