Exploring Acoustic Factor Analysis for Limited Test Data Speaker Verification

被引:0
|
作者
Mamodiya, Salil [1 ]
Kumar, Lay [1 ]
Das, Rohan Kumar [1 ]
Prasanna, S. R. Mahadeva [1 ]
机构
[1] Indian Inst Technol Guwahati, Dept Elect & Elect Engn, Gauhati 781039, Assam, India
关键词
speaker verification; limited data; AFA; MFCC;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
En text independent speaker verification (TI-SV) domain, recently proposed Acoustic Factor Analysis (AFA) model has shown its importance over conventional i-vector based approach. AFA takes into account the redundancies present in the mel frequency cepstral coefficient (MFCC) features. It transforms the features to a lower dimensional space which is much close to speaker subspace. In practical applications duration of the test data is very important for SV task. It may not be always the case that the test data of sufficient duration is provided. Limited data have less phonetic content, that makes TI-SV under limited data a challenging task. Previously using i-vector based approach on MFCC features, it has been proved that performance of the SV task drops as the duration of the test data is reduced. This work attempts to improve performance of SV task for limited duration test utterances using AFA model. A SV system is built based on AFA. A parallel SV system of conventional i-vector based approach using MFCC features is also created. Then SV is carried out for limited test data conditions (<= 10 s) on NIST SRE 2003 dataset using both the AFA and the i-vector based approach, AFA showing improved results over the latter case. The systems are then fused at the score level and their combination is found to give significant improvement over baseline performance highlighting importance of AFA based modeling approach for limited test data condition.
引用
收藏
页码:1397 / 1401
页数:5
相关论文
共 50 条
  • [21] An analysis of data fusion methods for speaker verification
    Farrell, KR
    Ramachandran, RP
    Mammone, RJ
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1129 - 1132
  • [22] Factor analysis based speaker verification using ASR
    Su, Hang
    Wegmann, Steven
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2223 - 2227
  • [23] Bilinear Factor Analysis for iVector Based Speaker Verification
    Lei, Yun
    Burget, Lukas
    Scheffer, Nicolas
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1586 - 1589
  • [24] Front-End Factor Analysis for Speaker Verification
    Dehak, Najim
    Kenny, Patrick J.
    Dehak, Reda
    Dumouchel, Pierre
    Ouellet, Pierre
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 788 - 798
  • [25] Speaker verification based on unsupervised normalization and factor analysis
    Department of Electronic Engineering and Information Science, University of Science and Technology of China, Hefei 230027, China
    Tien Tzu Hsueh Pao, 2009, 4 (776-779):
  • [26] New Developments in Joint Factor Analysis for Speaker Verification
    Aronowitz, Hagai
    Barkan, Oren
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 136 - 139
  • [27] VARIATIONAL BAYESIAN JOINT FACTOR ANALYSIS FOR SPEAKER VERIFICATION
    Zhao, Xianyu
    Dong, Yuan
    Zhao, Jian
    Lu, Liang
    Liu, Jiqing
    Wang, Haila
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4049 - +
  • [28] Acoustic-labial speaker verification
    Jourlin, P
    Luettin, J
    Genoud, D
    Wassner, H
    PATTERN RECOGNITION LETTERS, 1997, 18 (09) : 853 - 858
  • [29] Acoustic-labial speaker verification
    Jourlin, P
    Luettin, J
    Genoud, D
    Wassner, H
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 319 - 326
  • [30] Acoustic Feature Diversity and Speaker Verification
    Padmanabhan, R.
    Murthy, Hema A.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2110 - 2113