Ensemble based speaker recognition using unsupervised data selection

被引:0
|
作者
Huang, Chien-Lin [1 ]
Wang, Jia-Ching [1 ]
Ma, Bin [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taipei 32001, Taiwan
[2] Human Language Technol, Inst Infocomm Res I2R, Singapore 138632, Singapore
关键词
Speaker recognition; Ensemble classifier; Unsupervised data selection;
D O I
10.1017/ATSIP.2016.10
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Unsupervised Language Model Adaptation by Data Selection for Speech Recognition
    Khassanov, Yerbolat
    Chong, Tze Yuang
    Bigot, Benjamin
    Chng, Eng Siong
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 508 - 517
  • [32] UNSUPERVISED DATA SELECTION FOR SPEECH RECOGNITION WITH CONTRASTIVE LOSS RATIOS
    Park, Chanho
    Ahmad, Rehan
    Hain, Thomas
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 8587 - 8591
  • [33] Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification
    Hallajian, Bita
    Motameni, Homayun
    Akbari, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [34] Activity Recognition based on Wearable Sensors Using Selection/Fusion Hybrid Ensemble
    Min, Jun-Ki
    Cho, Sung-Bae
    2011 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2011, : 1319 - 1324
  • [35] Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition
    Wang, Shuai
    Yang, Yexin
    Wu, Zhanghao
    Qian, Yanmin
    Yu, Kai
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2598 - 2609
  • [36] Unsupervised speaker recognition based on competition between self-organizing maps
    Lapidot, I
    Guterman, H
    Cohen, A
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 877 - 887
  • [37] Importance of Nasality Measures for Speaker Recognition Data Selection and Performance Prediction
    Lei, Howard
    Lopez-Gonzalo, Eduardo
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 892 - 895
  • [38] Unsupervised speaker adaptation using reference speaker weighting
    Lai, Tsz-Chung
    Mak, Brian
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 380 - +
  • [39] Ensemble Unsupervised Feature Selection Based on Permutation and R-value
    Wang, Xiaomei
    Huang, Xin
    Lin, Xiaohui
    Yang, Yuansheng
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 795 - 800
  • [40] Emotion Recognition Based on Dynamic Ensemble Feature Selection
    Yang, Yong
    Wang, Guoyin
    Kong, Hao
    MAN-MACHINE INTERACTIONS, 2009, 59 : 217 - 225