Cosine Metric Learning for Speaker Verification in the i-Vector Space

被引：0

作者：

Bai, Zhong ^{[1
]}

Zhang, Xiao-Lei

Chen, Jingdong

机构：

[1] Northwestern Polytech Univ, Ctr Intelligent Acoust & Immers Commun, Fremont, CA 94539 USA

来源：

19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES | 2018年

基金：

中国国家自然科学基金;

关键词：

speaker verification; cosine metric learning; channel and session compensation;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

It is known that the equal-error-rate (EER) performance of a speaker verification system is determined by the overlap region of the decision scores of true and imposter trials. Also, the cosine similarity scores of the true or imposter trials produced by the state-of-the-art i-vector front-end approximate to a Gaussian distribution, and the overlap region of the two classes of trials depends mainly on their between-class distance. Motivated by the above facts, this paper presents a cosine similarity learning (CML) framework for speaker verification, which combines classical compensation techniques and the cosine similarity scoring for improving the EER performance. CML minimizes the overlap region by enlarging the between-class distance while introducing a regularization term to control the with-in class variance, which is initialized by a traditional channel compensation technique such as linear discriminant analysis. Experiments are carried out to compare the proposed CML framework with several traditional channel compensation baselines on the NIST speaker recognition evaluation data sets. The results show that CML outperforms all the studied initialization compensation techniques.

引用

下载

页码：1126 / 1130

页数：5

共 50 条

[21] An improved i-vector extraction algorithm for speaker verification
Li, Wei
Fu, Tianfan
Zhu, Jie
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015, : 1 - 9
[22] Noise Compensation in i-vector Space Using Linear Regression for Robust Speaker Verification
Baby, Renjith
Kumar, C. Santhosh
George, Kuruvachan K.
Panda, Ashish
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 161 - 165
[23] Sparsity Analysis and Compensation for i-Vector Based Speaker Verification
Li, Wei
Fu, Tian Fan
Zhu, Jie
Chen, Ning
SPEECH AND COMPUTER (SPECOM 2015), 2015, 9319 : 381 - 388
[24] Feature sparsity analysis for i-vector based speaker verification
Li, Wei
Fu, Tianfan
You, Hanxu
Zhu, Jie
Chen, Ning
SPEECH COMMUNICATION, 2016, 80 : 60 - 70
[25] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
Poddar, Arnab
Sahidullah, Md
Saha, Goutam
PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
[26] Geometric Discriminant Analysis for I-vector Based Speaker Verification
Xu, Can
Chen, Xianhong
He, Liang
Liu, Jia
2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1636 - 1640
[27] Bayesian Principal Component Analysis for I-Vector Speaker Verification
Rong Y.-F.
Chen C.
Chen D.-Y.
He Y.-J.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2021, 49 (11): : 2186 - 2194
[28] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
Kanagasundaram, A.
Dean, D.
Vogt, R.
McLaren, M.
Sridharan, S.
Mason, M.
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784
[29] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
McClanahan, Richard D.
Stewart, Bryan
De Leon, Phillip L.
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[30] SPEAKER VERIFICATION USING SIMPLIFIED AND SUPERVISED I-VECTOR MODELING
Li, Ming
Tsiartas, Andreas
Van Segbroeck, Maarten
Narayanan, Shrikanth S.
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7199 - 7203

← 1 2 3 4 5 →