An improved i-vector extraction algorithm for speaker verification

被引:7
|
作者
Li, Wei [1 ]
Fu, Tianfan [2 ]
Zhu, Jie [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn CSE, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Speaker verification; i-vector; Total factor space; Phonetic variability; Component reduction analysis (CRA);
D O I
10.1186/s13636-015-0061-x
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Over recent years, i-vector-based framework has been proven to provide state-of-the-art performance in speaker verification. Each utterance is projected onto a total factor space and is represented by a low-dimensional feature vector. Channel compensation techniques are carried out in this low-dimensional feature space. Most of the compensation techniques take the sets of extracted i-vectors as input. By constructing between-class covariance and within-class covariance, we attempt to minimize the between-class variance mainly caused by channel effect and to maximize the variance between speakers. In the real-world application, enrollment and test data from each user (or speaker) are always scarce. Although it is widely thought that session variability is mostly caused by channel effects, phonetic variability, as a factor that causes session variability, is still a matter to be considered. We propose in this paper a new i-vector extraction algorithm from the total factor matrix which we term component reduction analysis (CRA). This new algorithm contributes to better modelling of session variability in the total factor space. We reported results on the male English trials of the core condition of the NIST 2008 Speaker Recognition Evaluation (SREs) dataset. As measured both by equal error rate and the minimum values of the NIST detection cost function, 10-15% relative improvement is achieved compared to the baseline of traditional i-vector-based system.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 50 条
  • [1] An improved i-vector extraction algorithm for speaker verification
    Wei Li
    Tianfan Fu
    Jie Zhu
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [2] Improved i-vector extraction technique for speaker verification with short utterances
    Poddar A.
    Sahidullah M.
    Saha G.
    [J]. International Journal of Speech Technology, 2018, 21 (3) : 473 - 488
  • [3] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    [J]. PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [4] A Novel Boosting Algorithm for Improved i-Vector based Speaker Verification in Noisy Environments
    Sarkar, Sourjya
    Rao, K. Sreenivasa
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 671 - 675
  • [5] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [6] Improved Supervised Locality Preserving Projection for I-vector Based Speaker Verification
    You, Lanhua
    Guo, Wu
    Song, Yan
    Zhang, Sheng
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 62 - 66
  • [7] Improved i-vector Speaker Verification Based on WCCN and ZT-norm
    Xing, Yujuan
    Tan, Ping
    Zhang, Chengwen
    [J]. BIOMETRIC RECOGNITION, 2016, 9967 : 424 - 431
  • [8] Simplification of I-Vector Extraction for Speaker Identification
    XU Longting
    YANG Zhen
    SUN Linhui
    [J]. Chinese Journal of Electronics, 2016, 25 (06) : 1121 - 1126
  • [9] Improved i-Vector Representation for Speaker Diarization
    Xu, Yan
    McLoughlin, Ian
    Song, Yan
    Wu, Kui
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2016, 35 (09) : 3393 - 3404
  • [10] Simplification of I-Vector Extraction for Speaker Identification
    Xu Longting
    Yang Zhen
    Sun Linhui
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2016, 25 (06) : 1121 - 1126