Ensemble based speaker recognition using unsupervised data selection

被引：0

作者：

Huang, Chien-Lin ^{[1
]}

Wang, Jia-Ching ^{[1
]}

Ma, Bin ^{[2
]}

机构：

[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taipei 32001, Taiwan

[2] Human Language Technol, Inst Infocomm Res I2R, Singapore 138632, Singapore

来源：

APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING | 2016年 / 5卷

关键词：

Speaker recognition; Ensemble classifier; Unsupervised data selection;

D O I：

10.1017/ATSIP.2016.10

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.

引用

页数：9

共 50 条

[41] Adaptive systems for unsupervised speaker tracking and speech recognition
Herbig, Tobias
Gerl, Franz
Minker, Wolfgang
Haeb-Umbach, Reinhold
EVOLVING SYSTEMS, 2011, 2 (03) : 199 - 214
[42] Spatial features selection for unsupervised speaker segmentation and clustering
Martinez-Gonzalez, Beatriz
Pardo, Jose M.
Echeverry-Correa, Julian D.
San-Segundo, Ruben
EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 : 27 - 42
[43] Speaker recognition algorithm based on neural network ensemble and its simulation study
Qian, Bo
Li, Yan-Ping
Tang, Zhen-Min
Xu, Li-Min
2008, Acta Simulata Systematica Sinica, Beijing, 100854, China (20):
[44] Speech Emotion Recognition Using Unsupervised Feature Selection Algorithms
Bandela, Surekha Reddy
Kumar, T. Kishore
RADIOENGINEERING, 2020, 29 (02) : 353 - 364
[45] Scores Selection for Emotional Speaker Recognition
Shan, Zhenyu
Yang, Yingchun
ADVANCES IN BIOMETRICS, 2009, 5558 : 494 - 502
[46] Unsupervised cross-domain speaker recognition based on distribution alignment and adversarial learning
Chen, Zhigao
Zhao, Qingwei
Wang, Li
Wang, Wenchao
Shengxue Xuebao/Acta Acustica, 2021, 46 (05): : 767 - 774
[47] Fuzzy-Based Ensemble Feature Selection for Automated Estimation of Speaker Height and Age Using Vocal Characteristics
Jaid, Umniah Hameed
Abdulhassan, Alia Karim
IEEE ACCESS, 2023, 11 : 77895 - 77905
[48] Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition
Lei, Howard
ADVANCES IN BIOMETRICS, 2009, 5558 : 513 - 522
[49] Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection
Zhang, Xingyu
Zou, Xia
Sun, Meng
Wu, Penglong
SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 159 - 169
[50] UNSUPERVISED SPEAKER ADAPTATION OF DEEP NEURAL NETWORK BASED ON THE COMBINATION OF SPEAKER CODES AND SINGULAR VALUE DECOMPOSITION FOR SPEECH RECOGNITION
Xue, Shaofei
Jiang, Hui
Dai, Lirong
Liu, Qingfeng
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4555 - 4559

← 1 2 3 4 5 →