Exploring Acoustic Factor Analysis for Limited Test Data Speaker Verification

被引:0
|
作者
Mamodiya, Salil [1 ]
Kumar, Lay [1 ]
Das, Rohan Kumar [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, Assam, India
关键词
speaker verification; limited data; AFA; MFCC;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
En text independent speaker verification (TI-SV) domain, recently proposed Acoustic Factor Analysis (AFA) model has shown its importance over conventional i-vector based approach. AFA takes into account the redundancies present in the mel frequency cepstral coefficient (MFCC) features. It transforms the features to a lower dimensional space which is much close to speaker subspace. In practical applications duration of the test data is very important for SV task. It may not be always the case that the test data of sufficient duration is provided. Limited data have less phonetic content, that makes TI-SV under limited data a challenging task. Previously using i-vector based approach on MFCC features, it has been proved that performance of the SV task drops as the duration of the test data is reduced. This work attempts to improve performance of SV task for limited duration test utterances using AFA model. A SV system is built based on AFA. A parallel SV system of conventional i-vector based approach using MFCC features is also created. Then SV is carried out for limited test data conditions (<= 10 s) on NIST SRE 2003 dataset using both the AFA and the i-vector based approach, AFA showing improved results over the latter case. The systems are then fused at the score level and their combination is found to give significant improvement over baseline performance highlighting importance of AFA based modeling approach for limited test data condition.
引用
收藏
页码:1397 / 1401
页数:5
相关论文
共 50 条
  • [1] Exploring kernel discriminant analysis for speaker verification with limited test data
    Das, Rohan Kumar
    Manam, Akhil Babu
    Prasanna, S. R. Mahadeva
    PATTERN RECOGNITION LETTERS, 2017, 98 : 26 - 31
  • [2] Exploring different attributes of source information for speaker verification with limited test data
    Das, Rohan Kumar
    Prasanna, S. R. Mahadeva
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01): : 184 - 190
  • [3] Speaker Verification using Acoustic Factor Analysis with Phonetic Content Compensation in Limited and Degraded Test Conditions
    Manam, Akhil Babu
    Revanth, Tummala Sai
    Das, Rohan Kumar
    Prasanna, S. R. Mahadeva
    PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 1402 - 1406
  • [4] Acoustic Factor Analysis for Robust Speaker Verification
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 842 - 853
  • [5] Speaker Verification with the Constraint of Limited Data
    Kumari, Thyamagondlu Renukamurthy Jayanthi
    Jayanna, Haradagere Siddaramaiah
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2018, 14 (04): : 807 - 823
  • [6] Limited Data Speaker Verification using MFSR Analysis Technique
    Kumari, T. R. Jayanthi
    Jayanna, H. S.
    2015 INTERNATIONAL CONFERENCE ON TRENDS IN AUTOMATION, COMMUNICATIONS AND COMPUTING TECHNOLOGY (I-TACT-15), 2015,
  • [7] PLDA Speaker Verification with Limited Speech Data
    Ridzik, Andrej
    Rusko, Milan
    SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 325 - 332
  • [8] On the Use of Factor Analysis with Restricted Target Data in Speaker Verification
    Gonzalez-Dominguez, Javier
    Baker, Brendan
    Vogt, Robbie
    Gonzalez-Rodriguez, Joaquin
    Sridharan, Sridha
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 103 - 108
  • [9] IMPROVING PLDA SPEAKER VERIFICATION WITH LIMITED DEVELOPMENT DATA
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [10] Maximum Likelihood Acoustic Factor Analysis Models for Robust Speaker Verification in Noise
    Hasan, Taufiq
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 381 - 391