Methods of combining multiple classifiers with different features and their applications to text-independent speaker identification

被引:70
|
作者
Chen, K
Wang, L
Chi, HS
机构
[1] BEIJING UNIV,NATL LAB MACHINE PERCEPT,BEIJING 100871,PEOPLES R CHINA
[2] BEIJING UNIV,CTR INFORMAT SCI,BEIJING 100871,PEOPLES R CHINA
关键词
combination of multiple classifiers; different features; linear opinion pools; evidential reasoning; winner-take-all; maximum likelihood learning; EM algorithm; associative switch; text-independent speaker identification;
D O I
10.1142/S0218001497000196
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In practical applications of pattern recognition, there are often different features extracted from raw data which needs recognizing. Methods of combining multiple classifiers with different features are viewed as a general problem in various application areas of pattern recognition. In this paper, a systematic investigation has been made and possible solutions are classified into three frameworks, i.e. linear opinion pools, winner-take-all and evidential reasoning. For combining multiple classifiers with different features, a novel method is presented in the framework of linear opinion pools and a modified training algorithm for associative switch is also proposed in the framework of winner-take-all. In the framework of evidential reasoning, several typical methods are briefly reviewed for use. All aforementioned methods have already been applied to text-independent speaker identification. The simulations show that results yielded by the methods described in this paper are better than not only the individual classifiers' but also ones obtained by combining multiple classifiers with the same feature. It indicates that the use of combining multiple classifiers with different features is an effective way to attack the problem of text-independent speaker identification.
引用
收藏
页码:417 / 445
页数:29
相关论文
共 50 条
  • [31] Robust features for text-independent speaker recognition with short utterances
    Rania Chakroun
    Mondher Frikha
    [J]. Neural Computing and Applications, 2020, 32 : 13863 - 13883
  • [32] An overview of text-independent speaker recognition: From features to supervectors
    Kinnunen, Tomi
    Li, Haizhou
    [J]. SPEECH COMMUNICATION, 2010, 52 (01) : 12 - 40
  • [33] Robust features for text-independent speaker recognition with short utterances
    Chakroun, Rania
    Frikha, Mondher
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 13863 - 13883
  • [34] ROBUST TEXT-INDEPENDENT SPEAKER IDENTIFICATION USING GAUSSIAN MIXTURE SPEAKER MODELS
    REYNOLDS, DA
    ROSE, RC
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 72 - 83
  • [35] A real-time text-independent speaker identification system
    Cordella, LP
    Foggia, P
    Sansone, C
    Vento, M
    [J]. 12TH INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND PROCESSING, PROCEEDINGS, 2003, : 632 - 637
  • [36] Text-independent speaker identification using robust statistics estimation
    El Ayadi, Moataz
    Hassan, Abdel-Karim S. O.
    Abdel-Naby, Ahmed
    Elgendy, Omar A.
    [J]. SPEECH COMMUNICATION, 2017, 92 : 52 - 63
  • [37] Wavelet entropy and neural network for text-independent speaker identification
    Daqrouq, Khaled
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (05) : 796 - 802
  • [38] Robust text-independent speaker identification over telephone channels
    Murthy, HA
    Beaufays, F
    Heck, LP
    Weintraub, M
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05): : 554 - 568
  • [39] Text-independent speaker identification based on spectral weighting functions
    Ma, JY
    Gao, W
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 267 - 272
  • [40] HCRF-UBM approach for text-independent speaker identification
    Hong, Wei-Tyng
    [J]. COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2012, 64 (05) : 1120 - 1127