Ensemble based speaker recognition using unsupervised data selection

被引：0

作者：

Huang, Chien-Lin ^{[1
]}

Wang, Jia-Ching ^{[1
]}

Ma, Bin ^{[2
]}

机构：

[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taipei 32001, Taiwan

[2] Human Language Technol, Inst Infocomm Res I2R, Singapore 138632, Singapore

来源：

APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING | 2016年 / 5卷

关键词：

Speaker recognition; Ensemble classifier; Unsupervised data selection;

D O I：

10.1017/ATSIP.2016.10

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.

引用

页数：9

共 50 条

[31] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
Khassanov, Yerbolat
Chong, Tze Yuang
Bigot, Benjamin
Chng, Eng Siong
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
[32] UNSUPERVISED DATA SELECTION FOR SPEECH RECOGNITION WITH CONTRASTIVE LOSS RATIOS
Park, Chanho
Ahmad, Rehan
Hain, Thomas
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8587 - 8591
[33] Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification
Hallajian, Bita
Motameni, Homayun
Akbari, Ebrahim
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
[34] Activity Recognition based on Wearable Sensors Using Selection/Fusion Hybrid Ensemble
Min, Jun-Ki
Cho, Sung-Bae
2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1319 - 1324
[35] Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition
Wang, Shuai
Yang, Yexin
Wu, Zhanghao
Qian, Yanmin
Yu, Kai
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2598 - 2609
[36] Unsupervised speaker recognition based on competition between self-organizing maps
Lapidot, I
Guterman, H
Cohen, A
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 877 - 887
[37] Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction
Lei, Howard
Lopez-Gonzalo, Eduardo
INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 892 - 895
[38] Unsupervised speaker adaptation using reference speaker weighting
Lai, Tsz-Chung
Mak, Brian
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 380 - +
[39] Ensemble Unsupervised Feature Selection Based on Permutation and R-value
Wang, Xiaomei
Huang, Xin
Lin, Xiaohui
Yang, Yuansheng
2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 795 - 800
[40] Emotion Recognition Based on Dynamic Ensemble Feature Selection
Yang, Yong
Wang, Guoyin
Kong, Hao
MAN-MACHINE INTERACTIONS, 2009, 59 : 217 - 225

← 1 2 3 4 5 →