Exploring Acoustic Factor Analysis for Limited Test Data Speaker Verification

被引:0
|
作者
Mamodiya, Salil [1 ]
Kumar, Lay [1 ]
Das, Rohan Kumar [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, Assam, India
关键词
speaker verification; limited data; AFA; MFCC;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
En text independent speaker verification (TI-SV) domain, recently proposed Acoustic Factor Analysis (AFA) model has shown its importance over conventional i-vector based approach. AFA takes into account the redundancies present in the mel frequency cepstral coefficient (MFCC) features. It transforms the features to a lower dimensional space which is much close to speaker subspace. In practical applications duration of the test data is very important for SV task. It may not be always the case that the test data of sufficient duration is provided. Limited data have less phonetic content, that makes TI-SV under limited data a challenging task. Previously using i-vector based approach on MFCC features, it has been proved that performance of the SV task drops as the duration of the test data is reduced. This work attempts to improve performance of SV task for limited duration test utterances using AFA model. A SV system is built based on AFA. A parallel SV system of conventional i-vector based approach using MFCC features is also created. Then SV is carried out for limited test data conditions (<= 10 s) on NIST SRE 2003 dataset using both the AFA and the i-vector based approach, AFA showing improved results over the latter case. The systems are then fused at the score level and their combination is found to give significant improvement over baseline performance highlighting importance of AFA based modeling approach for limited test data condition.
引用
收藏
页码:1397 / 1401
页数:5
相关论文
共 50 条
  • [11] Intra-speaker variability compensation in speaker verification with limited enrolling data
    Garreton, Claudio
    Becerra Yoma, Nestor
    Molina, Carlos
    Huenupan, Fernando
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 509 - 512
  • [12] Simplified factor analysis in speaker verification
    Guo, Wu
    Li, Yijie
    Dai, Lirong
    2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1316 - 1319
  • [13] Acoustic Factor Analysis based Universal Background Model for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3126 - 3130
  • [14] Different Aspects of Source Information for Limited Data Speaker Verification
    Das, Rohan Kumar
    Pati, Debadatta
    Prasanna, S. R. Mahadeva
    2015 TWENTY FIRST NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2015,
  • [15] Improvements in factor analysis based speaker verification
    Kenny, Patrick
    Boulianne, Gilles
    Ouellet, Pierre
    Dumouchel, Pierre
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 113 - 116
  • [16] Speaker verification based on factor analysis and SVM
    Guo, Wu
    Dai, Li-Rong
    Wang, Ren-Hua
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2009, 31 (02): : 302 - 305
  • [17] A Fast Implementation of Factor Analysis for Speaker Verification
    Liu, Qingsong
    Huang, Wei
    Xu, Dongxing
    Cai, Hongbin
    Dai, Beiqian
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1077 - +
  • [18] A comparison of various adaptation methods for speaker verification with limited enrollment data
    Mak, Man-Wai
    Hsiao, Roger
    Mak, Brian
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 929 - 932
  • [19] System Source and Dynamic Features for Speaker Verification for Limited Data Condition
    Kumari, T. R. Jayanthi
    Jayanna, H. S.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 1458 - 1461
  • [20] Improving the PLDA based Speaker Verification in Limited Microphone Data Conditions
    Kanagasundaram, A.
    Dean, D.
    Gonzalez-Dominguez, J.
    Sridharan, S.
    Ramos, D.
    Gonzalez-Rodriguez, J.
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3641 - 3645