Ensemble based speaker recognition using unsupervised data selection

被引:0
|
作者
Huang, Chien-Lin [1 ]
Wang, Jia-Ching [1 ]
Ma, Bin [2 ]
机构
[1] Natl Cent Univ, Dept Comp Sci & Informat Engn, Taipei 32001, Taiwan
[2] Human Language Technol, Inst Infocomm Res I2R, Singapore 138632, Singapore
关键词
Speaker recognition; Ensemble classifier; Unsupervised data selection;
D O I
10.1017/ATSIP.2016.10
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents an ensemble-based speaker recognition using unsupervised data selection. Ensemble learning is a type of machine learning that applies a combination of several weak learners to achieve an improved performance than a single learner. A speech utterance is divided into several subsets based on its acoustic characteristics using unsupervised data selection methods. The ensemble classifiers are then trained with these non-overlapping subsets of speech data to improve the recognition accuracy. This new approach has two advantages. First, without any auxiliary information, we use ensemble classifiers based on unsupervised data selection to make use of different acoustic characteristics of speech data. Second, in ensemble classifiers, we apply the divide-and-conquer strategy to avoid a local optimization in the training of a single classifier. Our experiments on the 2010 and 2008 NIST Speaker Recognition Evaluation datasets show that using ensemble classifiers yields a significant performance gain.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Adaptive systems for unsupervised speaker tracking and speech recognition
    Herbig, Tobias
    Gerl, Franz
    Minker, Wolfgang
    Haeb-Umbach, Reinhold
    EVOLVING SYSTEMS, 2011, 2 (03) : 199 - 214
  • [42] Spatial features selection for unsupervised speaker segmentation and clustering
    Martinez-Gonzalez, Beatriz
    Pardo, Jose M.
    Echeverry-Correa, Julian D.
    San-Segundo, Ruben
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 73 : 27 - 42
  • [43] Speaker recognition algorithm based on neural network ensemble and its simulation study
    Qian, Bo
    Li, Yan-Ping
    Tang, Zhen-Min
    Xu, Li-Min
    2008, Acta Simulata Systematica Sinica, Beijing, 100854, China (20):
  • [44] Speech Emotion Recognition Using Unsupervised Feature Selection Algorithms
    Bandela, Surekha Reddy
    Kumar, T. Kishore
    RADIOENGINEERING, 2020, 29 (02) : 353 - 364
  • [45] Scores Selection for Emotional Speaker Recognition
    Shan, Zhenyu
    Yang, Yingchun
    ADVANCES IN BIOMETRICS, 2009, 5558 : 494 - 502
  • [46] Unsupervised cross-domain speaker recognition based on distribution alignment and adversarial learning
    Chen, Zhigao
    Zhao, Qingwei
    Wang, Li
    Wang, Wenchao
    Shengxue Xuebao/Acta Acustica, 2021, 46 (05): : 767 - 774
  • [47] Fuzzy-Based Ensemble Feature Selection for Automated Estimation of Speaker Height and Age Using Vocal Characteristics
    Jaid, Umniah Hameed
    Abdulhassan, Alia Karim
    IEEE ACCESS, 2023, 11 : 77895 - 77905
  • [48] Towards Structured Approaches to Arbitrary Data Selection and Performance Prediction for Speaker Recognition
    Lei, Howard
    ADVANCES IN BIOMETRICS, 2009, 5558 : 513 - 522
  • [49] Robust Speaker Recognition Using Improved GFCC and Adaptive Feature Selection
    Zhang, Xingyu
    Zou, Xia
    Sun, Meng
    Wu, Penglong
    SECURITY WITH INTELLIGENT COMPUTING AND BIG-DATA SERVICES, 2020, 895 : 159 - 169
  • [50] UNSUPERVISED SPEAKER ADAPTATION OF DEEP NEURAL NETWORK BASED ON THE COMBINATION OF SPEAKER CODES AND SINGULAR VALUE DECOMPOSITION FOR SPEECH RECOGNITION
    Xue, Shaofei
    Jiang, Hui
    Dai, Lirong
    Liu, Qingfeng
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4555 - 4559