Speech recognition using probabilistic and statistical models

被引:0
|
作者
Singh, Amber [1 ]
Anand, R. S. [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Roorkee, Uttar Pradesh, India
关键词
Automatic speech recognition (ASR); Mel frequency cepstral coefficients (MFCCs); EM algorithm; Hidden markov model; Gaussian mixture model; Vector quantization; Gaussian mixture model-Universal background model;
D O I
10.1109/CICN.2015.141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an implementation of probabilistic and statistical models for speech recognition. Three models namely Gaussian mixture model, hidden markov model and Gaussian mixture model - universal background model are discussed. In GMM, both speech identification of unknown isolated words and classification of unknown test patterns are discussed. In HMM, speech identification of isolated words are discussed. In GMM-UBM, speech identification of isolated words and speech classification of unknown test patterns are discussed. Isolated word recognizer build using all the three models for the recognition of isolated words can give 100% accuracy depending upon the initialization of the models. GMM-UBM is not found suitable for the classification of unknown test patterns.
引用
收藏
页码:686 / 690
页数:5
相关论文
共 50 条
  • [31] Graphical object recognition using statistical language models
    Keyes, L
    O'Sullivan, A
    Winstanley, A
    [J]. EIGHTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS 1 AND 2, PROCEEDINGS, 2005, : 1095 - 1099
  • [32] Partially occluded object recognition using statistical models
    Ying, ZR
    Castañon, D
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2002, 49 (01) : 57 - 78
  • [33] APPLICATION OF STATISTICAL TECHNIQUES TO SPEECH RECOGNITION
    BERGERVACHON, C
    [J]. AUTOMATISME, 1972, 17 (03): : 76 - +
  • [34] One approach to statistical speech recognition
    Blagojevic, M
    Durovic, Z
    Kovacevic, B
    [J]. Eurocon 2005: The International Conference on Computer as a Tool, Vol 1 and 2 , Proceedings, 2005, : 1401 - 1404
  • [35] STATISTICAL MODELING FOR AUTOMATIC SPEECH RECOGNITION
    MERCER, RL
    [J]. AFIPS CONFERENCE PROCEEDINGS, 1983, 52 : 643 - &
  • [36] A STATISTICAL APPROACH TO THE AUTOMATIC RECOGNITION OF SPEECH
    SMITH, JEK
    KLEM, L
    [J]. AMERICAN PSYCHOLOGIST, 1961, 16 (07) : 445 - 445
  • [37] Articulatory feature based continuous speech recognition using probabilistic lexical modeling
    Rasipuram, Ramya
    Magimai-Doss, Mathew
    [J]. COMPUTER SPEECH AND LANGUAGE, 2016, 36 : 233 - 259
  • [38] INTEGRATED PRONUNCIATION LEARNING FOR AUTOMATIC SPEECH RECOGNITION USING PROBABILISTIC LEXICAL MODELING
    Rasipuram, Ramya
    Razavi, Marzieh
    Magimai-Doss, Mathew
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5176 - 5180
  • [40] Speech Emotion Recognition Using Canonical Correlation Analysis and Probabilistic Neural Network
    Cen, Ling
    Ser, Wee
    Yu, Zhu Liang
    [J]. SEVENTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2008, : 859 - +