I-vector Extraction for Speaker Recognition Based on Dimensionality Reduction

被引:14
|
作者
Ibrahim, Noor Salwani [1 ]
Ramli, Dzati Athiar [1 ]
机构
[1] Univ Sains Malaysia, Sch Elect & Elect, Engn Campus, Nibong Tebal 14300, Pulau Pinang, Malaysia
关键词
Bob Spear toolbox; I-vectors; Dimensionality Reduction; UBM size; Frog Identification; SHORT SEQUENCES; IDENTIFICATION;
D O I
10.1016/j.procs.2018.08.126
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the domain of speaker recognition, many methods have been proposed over time. The technology for automatic speaker recognition has now reached a good level of performance but there is still need of improvement. In this paper, a new low-dimensional speaker- and channel-dependent space is defined using a simple factor analysis also known as i-vector. This space is named the total variability space because it models both speaker and channel variabilities. The i-vector subspace modelling is one of the recent methods that have become the state of the art technique in this domain. This method largely provides the benefit of modelling both the intra-domain and inter-domain variabilities into the same low dimensional space. In this study, 2656 syllables bio-acoustic signals from 55 species of frog taken from Intelligent Biometric Group, USM database are used for frog identification system. Parameters of the system are initially tuned such as Universal Background Model (UBM) size (32, 64 and 128 Gaussians) and i-vector dimensionality (100, 200 and 400 dimensions). To the end, we assess the effect of the parameter tuned and record the computation time. We observed that, the accuracy for smaller UBM size and higher i-vector dimensionality outperforms others with result of 91.11% is achieved. From this research, it can be concluded that UBM size and i-vector dimensionality effect the accuracy of frog identification based on i-vector. (C) 2018 The Authors. Published by Elsevier Ltd.
引用
收藏
页码:1534 / 1540
页数:7
相关论文
共 50 条
  • [41] Deep Learning Backend for Single and Multisession i-Vector Speaker Recognition
    Ghahabi, Omid
    Hernando, Javier
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 807 - 817
  • [42] Improved i-vector extraction technique for speaker verification with short utterances
    Poddar A.
    Sahidullah M.
    Saha G.
    International Journal of Speech Technology, 2018, 21 (03) : 473 - 488
  • [43] Effect of multicondition training on i-vector PLDA configurations for speaker recognition
    Rajan, Padmanabhan
    Kinnunen, Tomi
    Hautamaki, Ville
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3661 - 3664
  • [44] An I-Vector Backend for Speaker Verification
    Kenny, Patrick
    Stafylakis, Themos
    Alam, Jahangir
    Kockmann, Marcel
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2307 - 2311
  • [45] A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION
    Lei, Yun
    Burget, Lukas
    Scheffer, Nicolas
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6788 - 6791
  • [46] Non-speaker information reduction from Cosine Similarity Scoring in i-vector based speaker verification
    Zeinali, Hossein
    Mirian, Alireza
    Sameti, Hossein
    BabaAli, Bagher
    COMPUTERS & ELECTRICAL ENGINEERING, 2015, 48 : 226 - 238
  • [47] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
    Li, Wei
    Fu, Tian Fan
    Zhu, Jie
    Chen, Ning
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
  • [48] Feature sparsity analysis for i-vector based speaker verification
    Li, Wei
    Fu, Tianfan
    You, Hanxu
    Zhu, Jie
    Chen, Ning
    SPEECH COMMUNICATION, 2016, 80 : 60 - 70
  • [49] Geometric Discriminant Analysis for I-vector Based Speaker Verification
    Xu, Can
    Chen, Xianhong
    He, Liang
    Liu, Jia
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1636 - 1640
  • [50] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
    Kanagasundaram, A.
    Dean, D.
    Vogt, R.
    McLaren, M.
    Sridharan, S.
    Mason, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784