Speech recognition using probabilistic and statistical models

被引:0
|
作者
Singh, Amber [1 ]
Anand, R. S. [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Roorkee, Uttar Pradesh, India
关键词
Automatic speech recognition (ASR); Mel frequency cepstral coefficients (MFCCs); EM algorithm; Hidden markov model; Gaussian mixture model; Vector quantization; Gaussian mixture model-Universal background model;
D O I
10.1109/CICN.2015.141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an implementation of probabilistic and statistical models for speech recognition. Three models namely Gaussian mixture model, hidden markov model and Gaussian mixture model - universal background model are discussed. In GMM, both speech identification of unknown isolated words and classification of unknown test patterns are discussed. In HMM, speech identification of isolated words are discussed. In GMM-UBM, speech identification of isolated words and speech classification of unknown test patterns are discussed. Isolated word recognizer build using all the three models for the recognition of isolated words can give 100% accuracy depending upon the initialization of the models. GMM-UBM is not found suitable for the classification of unknown test patterns.
引用
收藏
页码:686 / 690
页数:5
相关论文
共 50 条
  • [21] On Recognition of Non-Native Speech Using Probabilistic Lexical Model
    Razavi, Marzieh
    Doss, Mathew Magimai
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 26 - 30
  • [22] Using statistical compatibility to derive advanced probabilistic fatigue models
    Fernandez-Canteli, Alfonso
    Castillo, Enrique
    Lopez-Aenlle, Manuel
    Seitl, Stanislav
    [J]. FATIGUE 2010, 2010, 2 (01): : 1131 - 1140
  • [23] Improving Probabilistic Record Linkage Using Statistical Prediction Models
    Moretti, Angelo
    Shlomo, Natalie
    [J]. INTERNATIONAL STATISTICAL REVIEW, 2023, 91 (03) : 368 - 394
  • [24] Continuous speech recognition using linear dynamic models
    Ma, Tao
    Srinivasan, Sundararajan
    Lazarou, Georgios
    Picone, Joseph
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2014, 17 (01) : 11 - 16
  • [25] Automatic speech recognition using hidden Markov models
    Botros, N.M.
    Teh, C.K.
    [J]. Microcomputer Applications, 1994, 13 (01): : 6 - 12
  • [26] Speech emotion recognition using hidden Markov models
    Nwe, TL
    Foo, SW
    De Silva, LC
    [J]. SPEECH COMMUNICATION, 2003, 41 (04) : 603 - 623
  • [27] ROBUST SPEECH RECOGNITION USING MULTIVARIATE COPULA MODELS
    Bayestehtashk, Alireza
    Shafran, Izhak
    Babaeian, Amir
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5890 - 5894
  • [28] Speech recognition based on statistical models including multiple phonetic decision trees
    Shiota, Sayaka
    Hashimoto, Kei
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    [J]. Acoustical Science and Technology, 2011, 32 (06): : 236 - 243
  • [29] Speech recognition based on statistical models including multiple phonetic decision trees
    Shiota, Sayaka
    Hashimoto, Kei
    Zen, Heiga
    Nankaku, Yoshihiko
    Lee, Akinobu
    Tokuda, Keiichi
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (06) : 236 - 243
  • [30] Partially Occluded Object Recognition Using Statistical Models
    Zhengrong Ying
    David Castañon
    [J]. International Journal of Computer Vision, 2002, 49 : 57 - 78