Discriminative Training using Heterogeneous Feature Vector for Hindi Automatic Speech Recognition System

被引:0
|
作者
Dua, Mohit [1 ]
Aggarwal, Rajesh Kumar [1 ]
Biswas, Mantosh [1 ]
机构
[1] Natl Inst Technol, Dept Comp Engn, Kurukshetra, Haryana, India
关键词
automatic speech recognition; MFCC; PLP; minimum phone error; HMM; ACOUSTIC MODELING PROBLEM;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Training and testing are the two phases that are used in statistical approach of designing an automatic speech recognition (ASR) system. The training phase includes parameterization of input speech signal and acoustic modeling of speech features. The paper proposes discriminative training of hidden markov Model (HMM) that uses heterogeneous feature vector for continuous Hindi ASR system. A linear interpolation of mel frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) is used to generate heterogeneous feature streams. The implemented work uses maximum mutual information estimation (MMIE) and minimum phone error (MPE) discriminative techniques for acoustic model training. The results show that MF-PLP parameterization with MPE discriminative techniques combination outperforms the other feature extraction and discriminative combination.
引用
收藏
页码:158 / 162
页数:5
相关论文
共 50 条
  • [1] Discriminative Techniques for Hindi Speech Recognition System
    Aggarwal, Rajesh Kumar
    Dave, Mayank
    [J]. INFORMATION SYSTEMS FOR INDIAN LANGUAGES, 2011, 139 : 261 - 266
  • [2] Discriminative Training for Automatic Speech Recognition
    Heigold, Georg
    Ney, Hermann
    Schlueter, Ralf
    Wiesler, Simon
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 58 - 69
  • [3] Performance evaluation of sequentially combined heterogeneous feature streams for Hindi speech recognition system
    R. K. Aggarwal
    M. Dave
    [J]. Telecommunication Systems, 2013, 52 : 1457 - 1466
  • [4] Performance evaluation of sequentially combined heterogeneous feature streams for Hindi speech recognition system
    Aggarwal, R. K.
    Dave, M.
    [J]. TELECOMMUNICATION SYSTEMS, 2013, 52 (03) : 1457 - 1466
  • [5] Discriminative training of HMMs for automatic speech recognition: A survey
    Jiang, Hui
    [J]. COMPUTER SPEECH AND LANGUAGE, 2010, 24 (04): : 589 - 608
  • [6] A feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K
    Ahuja, R
    Bondale, N
    Jose, T
    Krishnan, S
    Poddar, P
    Rao, PVS
    Raveendran, R
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 1998, 23 (4): : 313 - 340
  • [7] A feature-based hierarchical speech recognition system for Hindi
    K Samudravijaya
    R Ahuja
    N Bondale
    T Jose
    S Krishnan
    P Poddar
    xxPVS Rao
    R Raveendran
    [J]. Sadhana, 1998, 23 : 313 - 340
  • [8] Feature-based hierarchical speech recognition system for Hindi
    Samudravijaya, K.
    Ahuja, R.
    Bondale, N.
    Jose, T.
    Krishnan, S.
    Poddar, P.
    Rao, P.V.S.
    Raveendran, R.
    [J]. Sadhana - Academy Proceedings in Engineering Sciences, 1998, 23 (pt 4): : 313 - 340
  • [9] Optimizing Integrated Features for Hindi Automatic Speech Recognition System
    Dua, Mohit
    Aggarwal, Rajesh Kumar
    Biswas, Mantosh
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) : 959 - 976
  • [10] Training of Automatic Speech Recognition System on Noised Speech
    Prodeus, Arkadiy
    Kukharicheva, Kateryna
    [J]. 2016 4TH INTERNATIONAL CONFERENCE ON METHODS AND SYSTEMS OF NAVIGATION AND MOTION CONTROL (MSNMC), 2016, : 221 - 223