ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION

被引：0

作者：

Ben Kheder, Waad ^{[1
]}

Matrouf, Driss ^{[1
]}

Bonastre, Jean-Francois ^{[1
]}

Ajili, Moez ^{[1
]}

Bousquet, Pierre-Michel ^{[1
]}

机构：

[1] Univ Avignon, LIA, Avignon, France

来源：

2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP) | 2015年

关键词：

speaker recognition; i-vectors; additive noise;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

State-of-the-art speaker recognition systems performance degrades considerably in noisy environments even though they achieve very good results in clean conditions. In order to deal with this strong limitation, we aim in this work to remove the noisy part of an i-vector directly in the i-vector space. Our approach offers the advantage to operate only at the i-vector extraction level, letting the other steps of the system unchanged. A maximum a posteriori (MAP) procedure is applied in order to obtain clean version of the noisy i-vectors taking advantage of prior knowledge about clean i-vectors distribution. To perform this MAP estimation, Gaussian assumptions over clean and noise i-vectors distributions are made. Operating on NIST 2008 data, we show a relative improvement up to 60% compared with baseline system. Our approach also outperforms the "multi-style" backend training technique. The efficiency of the proposed method is obtained at the price of relative high computational cost. We present at the end some ideas to improve this aspect.

引用

页码：4190 / 4194

页数：5

共 50 条

[1] DEALING WITH ADDITIVE NOISE IN SPEAKER RECOGNITION SYSTEMS BASED ON I-VECTOR APPROACH
Matrouf, D.
Ben Kheder, W.
Bousquet, P-M.
Ajili, M.
Bonastre, J-F.
[J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2092 - 2096
[2] Noise Compensation in i-vector Space Using Linear Regression for Robust Speaker Verification
Baby, Renjith
Kumar, C. Santhosh
George, Kuruvachan K.
Panda, Ashish
[J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 161 - 165
[3] GENDER INDEPENDENT DISCRIMINATIVE SPEAKER RECOGNITION IN I-VECTOR SPACE
Cumani, Sandro
Glembek, Ondrej
Bruemmer, Niko
de Villiers, Edward
Laface, Pietro
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4361 - 4364
[4] Emotional Speaker Recognition Based on i-vector Space Model
Mansour, Asma
Chenchah, Farah
Lachiri, Zied
[J]. 2016 4TH INTERNATIONAL CONFERENCE ON CONTROL ENGINEERING & INFORMATION TECHNOLOGY (CEIT), 2016,
[5] DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
Hasan, Taufiq
Saeidi, Rahim
Hansen, John H. L.
van Leeuwen, David A.
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7663 - 7667
[6] A NOISE ROBUST I-VECTOR EXTRACTOR USING VECTOR TAYLOR SERIES FOR SPEAKER RECOGNITION
Lei, Yun
Burget, Lukas
Scheffer, Nicolas
[J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6788 - 6791
[7] I-vector based speaker recognition using advanced channel compensation techniques
Kanagasundaram, Ahilan
Dean, David
Sridharan, Sridha
McLaren, Mitchell
Vogt, Robbie
[J]. COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 121 - 140
[8] MULTICONDITION TRAINING OF GAUSSIAN PLDA MODELS IN I-VECTOR SPACE FOR NOISE AND REVERBERATION ROBUST SPEAKER RECOGNITION
Garcia-Romero, Daniel
Zhou, Xinhui
Espy-Wilson, Carol Y.
[J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4257 - 4260
[9] I-vector Based Speaker Gender Recognition
Wang, Minghe
Chen, Ying
Tang, Zhenmin
Zhang, Erhua
[J]. 2015 IEEE ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2015, : 729 - 732
[10] i-vector Based Speaker Recognition on Short Utterances
Kanagasundaram, Ahilan
Vogt, Robbie
Dean, David
Sridharan, Sridha
Mason, Michael
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2352 - +

← 1 2 3 4 5 →