A GMM SUPERVECTOR KERNEL WITH THE BHATTACHARYYA DISTANCE FOR SVM BASED SPEAKER RECOGNITION

被引：0

作者：

You, Chang Huai ^{[1
]}

Lee, Kong Aik ^{[1
]}

Li, Haizhou ^{[1
]}

机构：

[1] ASTAR, Inst Infocomm Res I2R, Singapore, Singapore

来源：

2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年

关键词：

Gaussian Mixture Model; Support Vector Machine; Supervector; Speaker Verification; NIST Evaluation; VERIFICATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally believed that speaker cues are also partly conveyed by second order statistics. In this paper, we extend the Bhattacharyya-based SVM kernel by constructing the supervector with the mean statistical vector and the covariance statistical vector. Comparing with the Kullback-Leibler (KL) kernel, we demonstrate the effectiveness of the new kernel on the 2006 National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) dataset.

引用

页码：4221 / 4224

页数：4

共 50 条

[31] Performances Evaluation of GMM-UBM and GMM-SVM for Speaker Recognition in Realistic World
Asbai, Nassim
Amrouche, Abderrahmane
Debyeche, Mohamed
NEURAL INFORMATION PROCESSING, PT II, 2011, 7063 : 284 - 291
[32] Speaker Recognition and Speech Emotion Recognition Based on GMM
Xu, Shupeng
Liu, Yan
Liu, Xiping
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS, 2013, : 434 - 436
[33] Improvement in Supervector Linear Kernel SVM for Speaker Identification Using Feature Enhancement and Training Length Adjustment
So, Byung-Min
Kim, Kyung Wha
Kim, Min-Seok
Yang, Ii-Ho
Kim, Myung-Jae
Yu, Ha-Jin
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (06): : 330 - 336
[34] Speaker verification normalization sequence kernel based on Gaussian mixture model super-vector and Bhattacharyya distance
Xing, YuJuan
Tan, Ping
Wang, Xin
JOURNAL OF LOW FREQUENCY NOISE VIBRATION AND ACTIVE CONTROL, 2021, 40 (01) : 60 - 71
[35] Speaker recognition based on the combination of GMM and SVDD
Zhou, Yuhuan
Zhang, Xiongwei
Wang, Jinming
Gong, Yong
Zhou, Yi
PRZEGLAD ELEKTROTECHNICZNY, 2011, 87 (03): : 329 - 332
[36] COMPARISON BETWEEN GMM-SVM SEQUENCE KERNEL AND GMM: APPLICATION TO SPEECH EMOTION RECOGNITION
Trabelsi, I.
Ben Ayed, D.
Ellouze, N.
JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2016, 11 (09): : 1221 - 1233
[37] Secondary classification for GMM based speaker recognition
Pelecanos, Jason
Povey, Dan
Ramaswamy, Ganesh
2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 109 - 112
[38] Feature mapping based on GMM supervector
Guo, Wu
Dai, Lirong
2008 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING, VOLS 1 AND 2, PROCEEDINGS, 2008, : 1081 - 1085
[39] Speaker Recognition Based on GMM with an Embedded TDNN
Chen, Cunbao
Zhao, Li
NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2009, 5864 : 746 - 753
[40] A hybrid system based on GMM-SVM for Speaker Identification
Chakroun, Rania
Zouari, Leila Beltaifa
Frikha, Mondher
Ben Hamida, Ahmed
2015 15TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS (ISDA), 2015, : 654 - 658

← 1 2 3 4 5 →