Co-whitening of i-vectors for short and long duration speaker verification

被引:0
|
作者
Xu, Longting [1 ]
Lee, Kong Aik [2 ]
Li, Haizhou [1 ]
Yang, Zhen [3 ]
机构
[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore
[2] NEC Corp Ltd, Data Sci Res Labs, Tokyo, Japan
[3] Nanjing Univ Posts & Telecommun, Broadband Wireless Commun & Sensor Network Techno, Nanjing, Jiangsu, Peoples R China
关键词
Speaker recognition; co-whitening; short duration; i-vector; text-independent; canonical correlation analysis;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An i-vector is a fixed-length and low-rank representation of a speech utterance. It has been used extensively in text independent speaker verification. Ideally, speech utterances from the same speaker would map to an unique i-vector. However, this is not the case due to some intrinsic and extrinsic factors like physical condition of the speaker, channel difference, noise and notably the duration of speech utterances. In particular, we found that i-vectors extracted from short utterances exhibit larger variance than that of long utterances. To address the problem, we propose a co-whitening approach, taking into account the duration, while maximizing the correlation between the i-vectors of short and long duration. The proposed co-whitening method was derived based on canonical correlation analysis (CCA). Experimental results on NIST SRE 2010 show that co-whitening method is effective in compensating the duration mismatch, leading to a reduction of up to 13.07% in equal error rate (EER).
引用
收藏
页码:1066 / 1070
页数:5
相关论文
共 50 条
  • [1] Duration compensation of i-vectors for short duration speaker verification
    Ma, Jianbo
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    Lee, Kong Aik
    ELECTRONICS LETTERS, 2017, 53 (06) : 405 - 407
  • [2] Senone I-Vectors for Robust Speaker Verification
    Tan, Zhili
    Zhu, Yingke
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [3] Emotional Speaker Verification Based on I-vectors
    Mackova, Lenka
    Cizmar, Anton
    2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 533 - 536
  • [4] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
    Larcher, Anthony
    Bousquet, Pierre-Michel
    Lee, Kong Aik
    Matrouf, Driss
    Li, Haizhou
    Bonastre, Jean-Francois
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
  • [5] Novel Quality Metric for Duration Variability Compensation in Speaker Verification using i-Vectors
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 298 - 303
  • [6] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
    Larcher, Anthony
    Bousquet, Pierre-Michel
    Lee, Kong Aik
    Matrouf, Driss
    Li, Haizhou
    Bonastre, Jean-Francois
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
  • [7] Denoised Senone I-Vectors for Robust Speaker Verification
    Tan, Zhili
    Mak, Man-Wai
    Mak, Brian Kan-Wing
    Zhu, Yingke
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 820 - 830
  • [8] CNN-based joint mapping of short and long utterance i-vectors for speaker verification using short utterances
    Guo, Jinxi
    Nookala, Usha Amrutha
    Alwan, Abeer
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3712 - 3716
  • [9] An Investigation of Non-linear i-vectors for speaker verification
    Chen, Nanxin
    Villalba, Jesus
    Dehak, Najim
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 87 - 91
  • [10] Robust Speaker Verification Using GFCC Based i-Vectors
    Jeevan, Medikonda
    Dhingra, Atul
    Hanmandlu, M.
    Panigrahi, B. K.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 1, 2017, 395 : 85 - 91