Speaker-independent Thai polysyllabic word recognition using hidden Markov model

被引:0
|
作者
Ahkuputra, V
Jitapunkul, S
Pornsukchandra, W
Luksaneeyanawin, S
机构
关键词
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This correspondence presents a speech recognition system of speaker-independent Thai polysyllabic words. This development is based on the Discrete Hidden Markov Model in conjunction with vector quantization algorithm, also endpoint detection algorithm for syllable endpoint detection and separation, and time normalization algorithm. The 70-Thai word vocabulary are subdivided into four sets comprising single, double, and triple syllabled words, 20 words in each set, and the last set consists of 10-Thai numeric words, zero to nine. The seperated speech training set and testing set are composed of both male and female speakers within the range of 18 to 25 years old. Upon the tonal characteristics of Thai language, the algorithms and the model parameters are modified in order to be applicable to the Thai language. The experiments on the effects of model parameter variations on recognition rate are conducted. The model parameters are number of codebooks, number of model states, and number of training speakers. The results show that the increase in the number of codebook and the number of model states have the major effect on the recognition rates. Also, the number of training speakers has less effect than the others. The average recognition rate of this speaker-independent recognition system is 89.906 percent for 40 speakers testing set using 256-vector codebook of 10-order linear prediction coefficient and 15-state model parameters. The recognition rate of the four sets of words are 86.750 percent for single-syllabled words, 92.375 percent for double-syllabled words, 96.250 percent for triple-syllabled words, and 84.250 percent for the numeric words.
引用
收藏
页码:593 / 599
页数:7
相关论文
共 50 条
  • [1] Speaker-independent Mandarin polysyllabic word recognition
    Chang, HY
    Chen, B
    Chou, CS
    Liu, CM
    [J]. ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 329 - 332
  • [2] SPEAKER-INDEPENDENT ISOLATED WORD RECOGNITION USING MULTIPLE HIDDEN MARKOV-MODELS
    ZHANG, Y
    DESILVA, CJS
    TOGNERI, R
    ALDER, M
    ATTIKIOUZEL, Y
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1994, 141 (03): : 197 - 202
  • [3] SPEAKER-INDEPENDENT PHONE RECOGNITION USING HIDDEN MARKOV-MODELS
    LEE, KF
    HON, HW
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (11): : 1641 - 1648
  • [4] Speaker-independent embedded speech recognition using Hidden Markov Models
    Marufo da Silva, Mariano
    Evin, Diego A.
    Verrastro, Sebastian
    [J]. IEEE CACIDI 2016 - IEEE CONFERENCE ON COMPUTER SCIENCES, 2016,
  • [5] ON THE APPLICATION OF VECTOR QUANTIZATION AND HIDDEN MARKOV-MODELS TO SPEAKER-INDEPENDENT, ISOLATED WORD RECOGNITION
    RABINER, LR
    LEVINSON, SE
    SONDHI, MM
    [J]. BELL SYSTEM TECHNICAL JOURNAL, 1983, 62 (04): : 1075 - 1105
  • [6] On the improvements of speaker-independent isolated word recognition using chaotic model
    Barbashov, OG
    Fradkov, AL
    Maleev, OG
    Romashov, NA
    Yushmanov, DA
    [J]. CONTROL OF OSCILLATIONS AND CHAOS - 1997 1ST INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS 1-3, 1997, : 142 - 143
  • [7] DYNAMIC SPEAKER ADAPTATION IN SPEAKER-INDEPENDENT WORD RECOGNITION
    HEWETT, AJ
    HOLMES, G
    YOUNG, SJ
    [J]. PROCEEDINGS : INSTITUTE OF ACOUSTICS, VOL 8, PART 7: SPEECH & HEARING, 1986, 8 : 275 - 282
  • [8] ALGORITHM TURNS TO SPEAKER-INDEPENDENT WORD RECOGNITION
    OHR, S
    [J]. ELECTRONIC DESIGN, 1983, 31 (19) : 40 - 41
  • [9] SPEAKER-INDEPENDENT WORD RECOGNITION USING FUZZY PATTERN-MATCHING
    FUJIMOTO, J
    NAKATANI, T
    YONEYAMA, M
    [J]. FUZZY SETS AND SYSTEMS, 1989, 32 (02) : 181 - 191
  • [10] HIGH-PERFORMANCE SPEAKER-INDEPENDENT WORD RECOGNITION
    DODDINGTON, GR
    HYDRICK, BM
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 64 : S182 - S182