Minimum Phoneme Error based filter bank analysis for speech recognition

被引:3
|
作者
Huang, Hao [1 ]
Zhu, Jie [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200040, Peoples R China
关键词
D O I
10.1109/ICME.2006.262722
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper the optimal filter-bank design method based on the Minimum Phone Error (MPE) criteria is investigated. We use Gaussian type filter bank for optimization and various parameters of the filters such as gain, bandwidth and center frequency are trained aiming at maximize the MPE objective function to reduce word error. Preliminary experimental results on a large vocabulary continuous Mandarin speech recognition task given in this paper showed that, compared with both the untrained Gaussian type filters and traditional triangle shaped filter bank, cepstral coefficients derived from the optimized filter bank parameters result in a superior performance for word accuracy. The filters consistent with the MPE criteria are also illustrated.
引用
收藏
页码:1081 / +
页数:2
相关论文
共 50 条
  • [1] Minimum phoneme error based heteroscedastic linear discriminant analysis for speech recognition
    Zhang, B
    Matsoukas, S
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 925 - 928
  • [2] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4213 - 4238
  • [3] Confusion analysis in phoneme based speech recognition in Hindi
    Bhatt, Shobha
    Dev, Amita
    Jain, Anurag
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 (10): : 4213 - 4238
  • [4] Confusion analysis in phoneme based speech recognition in Hindi
    Shobha Bhatt
    Amita Dev
    Anurag Jain
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4213 - 4238
  • [5] Error Minimization in Phoneme based Automated Speech Recognition for Similar Sounding Phonemes
    Gangaputra, Karan
    [J]. INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 402 - 409
  • [6] REVERBERANT SPEECH RECOGNITION: A PHONEME ANALYSIS
    Parada, Pablo Peso
    Sharma, Dushyant
    Naylor, Patrick A.
    van Waterschoot, Toon
    [J]. 2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 567 - 571
  • [7] Clustering Approach in Speech Phoneme Recognition Based on Statistical Analysis
    Tak, Gaurav Kumar
    Bhargava, Vaibhav
    [J]. RECENT TRENDS IN NETWORK SECURITY AND APPLICATIONS, 2010, 89 : 483 - 489
  • [8] PROTOTYPE-BASED MINIMUM ERROR TRAINING FOR SPEECH RECOGNITION
    MCDERMOTT, E
    KATAGIRI, S
    [J]. APPLIED INTELLIGENCE, 1994, 4 (03) : 245 - 256
  • [9] Filter bank subtraction for robust speech recognition
    Onoe, K
    Segi, H
    Kobayakawa, T
    Sato, S
    Homma, S
    Imai, T
    Ando, A
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 483 - 488
  • [10] PCA-based human auditory filter bank for speech recognition
    Nhat, VDM
    Lee, SY
    [J]. 2004 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING & COMMUNICATIONS (SPCOM), 2004, : 393 - 397