Speaker recognition using PCA-based feature transformation

被引:12
|
作者
Ahmed, Ahmed Isam [1 ]
Chiverton, John P. [1 ]
Ndzi, David L. [2 ]
Becerra, Victor M. [1 ]
机构
[1] Univ Portsmouth, Sch Energy & Elect Engn, Portsmouth PO1 3DJ, Hants, England
[2] Univ West Scotland, Sch Comp Engn & Phys Sci, Paisley PA1 2BE, Renfrew, Scotland
关键词
Weighted principal component analysis; Feature fusion; I-vector system; FRONT-END; IDENTIFICATION; MFCC; MODEL;
D O I
10.1016/j.specom.2019.04.001
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper introduces a Weighted-Correlation Principal Component Analysis (WCR-PCA) for efficient transformation of speech features in speaker recognition. A Recurrent Neural Network (RNN) technique is also introduced to perform the weighted PCA. The weights are taken as the log-likelihood values from a fitted Single Gaussian-Background Model (SG-BM). For speech features, we show that there are large differences between feature variances which makes covariance based PCA less optimal. A comparative study of the performance of speaker recognition is presented using weighted and unweighted correlation and covariance based PCA. Extensions to improve the extraction of MFCC and LPCC features of speech are also proposed. These are Odd Even filter banks MFCC (OE-MFCC) and Multitaper-Fitted LPCC. The methodologies are evaluated for the i-vector speaker recognition system. A subset of the 2010 NIST speaker recognition evaluation set is used in the performance testing in addition to evaluations on the VoxCeleb1 dataset. A relative improvement of 44% in terms of EER is found in the system performance using the NIST data and 18% using the VoxCeleb1 dataset.
引用
收藏
页码:33 / 46
页数:14
相关论文
共 50 条
  • [1] A PCA-based MFDWC Feature Parameter for Speaker Verification System
    Hahm, Seong-Jun
    Jung, Ho-Youl
    Chung, Hyun-Yeol
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (01): : 36 - 42
  • [2] PCA-based Feature Extraction for Phonotactic Language Recognition
    Mikolov, Tomas
    Plchot, Oldrich
    Glembek, Ondrej
    Matejka, Pavel
    Burget, Lukas
    Cernocky, Jan Honza
    [J]. ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 251 - 255
  • [3] PCA-based feature transformation for classification: Issues in medical diagnostics
    Pechenizkiy, M
    Tsymbal, A
    Puuronen, S
    [J]. 17TH IEEE SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, PROCEEDINGS, 2004, : 535 - 540
  • [4] PCA-based feature extraction using class information
    Park, MS
    Na, JH
    Choi, JY
    [J]. INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 341 - 345
  • [5] Exploring Dataset Similarities using PCA-based Feature Selection
    Siegert, Ingo
    Boeck, Ronald
    Wendemuth, Andreas
    Vlasenko, Bogdan
    [J]. 2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 387 - 393
  • [6] Using MAP Estimation of Feature Transformation for Speaker Recognition
    Zhu, Donglai
    Ma, Bin
    Li, Haizhou
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 849 - 852
  • [7] PCA-based recognition for efficient inpainting
    Korah, T
    Rasmussen, C
    [J]. COMPUTER VISION - ACCV 2006, PT I, 2006, 3851 : 206 - 215
  • [8] PCA-based Arabic Character feature extraction
    Zidouri, Abdelmalek
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 652 - 655
  • [9] Adaptive PCA-based feature drift detection using statistical measure
    Agrahari, Supriya
    Singh, Anil Kumar
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2022, 25 (06): : 4481 - 4494
  • [10] Adaptive PCA-based feature drift detection using statistical measure
    Supriya Agrahari
    Anil Kumar Singh
    [J]. Cluster Computing, 2022, 25 : 4481 - 4494