I-vector Extraction for Speaker Recognition Based on Dimensionality Reduction

被引:14
|
作者
Ibrahim, Noor Salwani [1 ]
Ramli, Dzati Athiar [1 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
关键词
Bob Spear toolbox; I-vectors; Dimensionality Reduction; UBM size; Frog Identification; SHORT SEQUENCES; IDENTIFICATION;
D O I
10.1016/j.procs.2018.08.126
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the domain of speaker recognition, many methods have been proposed over time. The technology for automatic speaker recognition has now reached a good level of performance but there is still need of improvement. In this paper, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis also known as i-vector. This space is named the total variability space because it models both speaker and channel variabilities. The i-vector subspace modelling is one of the recent methods that have become the state of the art technique in this domain. This method largely provides the benefit of modelling both the intra-domain and inter-domain variabilities into the same low dimensional space. In this study, 2656 syllables bio-acoustic signals from 55 species of frog taken from Intelligent Biometric Group, USM database are used for frog identification system. Parameters of the system are initially tuned such as Universal Background Model (UBM) size (32, 64 and 128 Gaussians) and i-vector dimensionality (100, 200 and 400 dimensions). To the end, we assess the effect of the parameter tuned and record the computation time. We observed that, the accuracy for smaller UBM size and higher i-vector dimensionality outperforms others with result of 91.11% is achieved. From this research, it can be concluded that UBM size and i-vector dimensionality effect the accuracy of frog identification based on i-vector. (C) 2018 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:1534 / 1540
页数:7
相关论文
共 50 条
  • [21] An improved i-vector extraction algorithm for speaker verification
    Wei Li
    Tianfan Fu
    Jie Zhu
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [22] An improved i-vector extraction algorithm for speaker verification
    Li, Wei
    Fu, Tianfan
    Zhu, Jie
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015, : 1 - 9
  • [23] Generalized Discriminant Analysis (GDA) for Improved i-Vector Based Speaker Recognition
    Bahmaninezhad, Fahimeh
    Hansen, John H. L.
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3643 - 3647
  • [24] Generalized cosine similarity in I-vector based automatic speaker recognition systems
    Drgas, Szymon
    Dabrowski, Adam
    2013 SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS (SPA), 2013, : 73 - 77
  • [25] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [26] GENDER INDEPENDENT DISCRIMINATIVE SPEAKER RECOGNITION IN I-VECTOR SPACE
    Cumani, Sandro
    Glembek, Ondrej
    Bruemmer, Niko
    de Villiers, Edward
    Laface, Pietro
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4361 - 4364
  • [27] ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION
    Ben Kheder, Waad
    Matrouf, Driss
    Bonastre, Jean-Francois
    Ajili, Moez
    Bousquet, Pierre-Michel
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4190 - 4194
  • [28] I-vector based speaker recognition using advanced channel compensation techniques
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    McLaren, Mitchell
    Vogt, Robbie
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 121 - 140
  • [29] Speaker recognition based on discriminant i-vector local distance preserving projection
    Li, Zhiyi
    He, Liang
    Zhang, Weiqiang
    Liu, Jia
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2012, 52 (05): : 598 - 601
  • [30] AN IMPROVED UNCERTAINTY PROPAGATION METHOD FOR ROBUST I-VECTOR BASED SPEAKER RECOGNITION
    Ribas, Dayana
    Vincent, Emmanuel
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6331 - 6335