Co-whitening of i-vectors for short and long duration speaker verification

被引：0

作者：

Xu, Longting ^{[1
]}

Lee, Kong Aik ^{[2
]}

Li, Haizhou ^{[1
]}

Yang, Zhen ^{[3
]}

机构：

[1] Natl Univ Singapore, Dept Elect & Comp Engn, Singapore, Singapore

[2] NEC Corp Ltd, Data Sci Res Labs, Tokyo, Japan

[3] Nanjing Univ Posts & Telecommun, Broadband Wireless Commun & Sensor Network Techno, Nanjing, Jiangsu, Peoples R China

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

关键词：

Speaker recognition; co-whitening; short duration; i-vector; text-independent; canonical correlation analysis;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An i-vector is a fixed-length and low-rank representation of a speech utterance. It has been used extensively in text independent speaker verification. Ideally, speech utterances from the same speaker would map to an unique i-vector. However, this is not the case due to some intrinsic and extrinsic factors like physical condition of the speaker, channel difference, noise and notably the duration of speech utterances. In particular, we found that i-vectors extracted from short utterances exhibit larger variance than that of long utterances. To address the problem, we propose a co-whitening approach, taking into account the duration, while maximizing the correlation between the i-vectors of short and long duration. The proposed co-whitening method was derived based on canonical correlation analysis (CCA). Experimental results on NIST SRE 2010 show that co-whitening method is effective in compensating the duration mismatch, leading to a reduction of up to 13.07% in equal error rate (EER).

引用

页码：1066 / 1070

页数：5

共 50 条

[1] Duration compensation of i-vectors for short duration speaker verification
Ma, Jianbo
Sethu, Vidhyasaharan
Ambikairajah, Eliathamby
Lee, Kong Aik
ELECTRONICS LETTERS, 2017, 53 (06) : 405 - 407
[2] Senone I-Vectors for Robust Speaker Verification
Tan, Zhili
Zhu, Yingke
Mak, Man-Wai
Mak, Brian Kan-Wing
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[3] Emotional Speaker Verification Based on I-vectors
Mackova, Lenka
Cizmar, Anton
2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 533 - 536
[4] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
Larcher, Anthony
Bousquet, Pierre-Michel
Lee, Kong Aik
Matrouf, Driss
Li, Haizhou
Bonastre, Jean-Francois
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
[5] Novel Quality Metric for Duration Variability Compensation in Speaker Verification using i-Vectors
Poddar, Arnab
Sahidullah, Md
Saha, Goutam
2017 NINTH INTERNATIONAL CONFERENCE ON ADVANCES IN PATTERN RECOGNITION (ICAPR), 2017, : 298 - 303
[6] I-VECTORS IN THE CONTEXT OF PHONETICALLY-CONSTRAINED SHORT UTTERANCES FOR SPEAKER VERIFICATION
Larcher, Anthony
Bousquet, Pierre-Michel
Lee, Kong Aik
Matrouf, Driss
Li, Haizhou
Bonastre, Jean-Francois
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4773 - 4776
[7] Denoised Senone I-Vectors for Robust Speaker Verification
Tan, Zhili
Mak, Man-Wai
Mak, Brian Kan-Wing
Zhu, Yingke
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (04) : 820 - 830
[8] CNN-based joint mapping of short and long utterance i-vectors for speaker verification using short utterances
Guo, Jinxi
Nookala, Usha Amrutha
Alwan, Abeer
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3712 - 3716
[9] An Investigation of Non-linear i-vectors for speaker verification
Chen, Nanxin
Villalba, Jesus
Dehak, Najim
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 87 - 91
[10] Robust Speaker Verification Using GFCC Based i-Vectors
Jeevan, Medikonda
Dhingra, Atul
Hanmandlu, M.
Panigrahi, B. K.
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL, NETWORKS, COMPUTING, AND SYSTEMS (ICSNCS 2016), VOL 1, 2017, 395 : 85 - 91

← 1 2 3 4 5 →