Comparative study of continuous hidden Markov models (CHMM) and artificial neural network (ANN) on speaker identification system

被引:0
|
作者
Kasuriya, Sawit [1 ]
Wutiwiwatchai, Chai [1 ]
Achariyakulporn, Varin [1 ]
Tanprasert, Chularat [1 ]
机构
[1] Info. R. and D. Division, Natl. Electronics Comp. Technol. C., Min. of Sci., Technol., and Environ., Sri-Ayudhaya Rd., Phayathai Bangkok, 10400, Thailand
关键词
D O I
暂无
中图分类号
学科分类号
摘要
This paper reports a comparative study between a continuous hidden Markov model (CHMM) and an artificial neural network (ANN) on a text dependent, closed set speaker identification (SID) system with Thai language recording in office and telephone environment. Thai isolated digit 0-9 and their concatenation are used as speaking text. Mel frequency cepstral coefficients (MFCC) are selected as the studied features. Two well-known recognition engines, CHMM and ANN, are conducted and compared. The ANN system (multilayer perceptron network with backpropagation learning algorithm) is applied with a special design of input feeding methods in avoiding the distortion from the normalization process. The general Gaussian density distribution HMM is developed for CHMM system. After optimizing some system's parameters by performing some preliminary experiments, CHMM gives the best identification rate at 90.4%, which is slightly better than 90.1% of ANN on digit 5 in office environment. For telephone environment, ANN gives the best identification rate at 88.84% on digit 0, which is higher than 81.1% of CHMM on digit 3. When using 3-concatenated digit, the identification rate of ANN and CHMM achieves 97.3% and 95.7% respectively for office environment, and 92.1% and 96.3% respectively for telephone environment.
引用
收藏
页码:673 / 683
相关论文
共 50 条
  • [1] Comparative study of continuous hidden Markov models (CHMM) and artificial neural network (ANN) on speaker identification system
    Kasuriya, S
    Wutiwiwatchai, C
    Acharryakulporn, V
    Tanprasert, C
    [J]. INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2001, 9 (06) : 673 - 683
  • [2] Speaker identification using hidden Markov models
    Inman, M
    Danforth, D
    Hangai, S
    Sato, K
    [J]. ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 609 - 612
  • [3] A STUDY ON SPEAKER ADAPTATION OF THE PARAMETERS OF CONTINUOUS DENSITY HIDDEN MARKOV-MODELS
    LEE, CH
    LIN, CH
    JUANG, BH
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (04) : 806 - 814
  • [4] Comparative study of discrete, semicontinuous, and continuous hidden Markov models
    Huang, X.D.
    Hon, H.W.
    Hwang, M.Y.
    Lee, K.F.
    [J]. Computer Speech and Language, 1993, 7 (04): : 359 - 368
  • [5] Speaker and gender normalization for continuous-density hidden Markov models
    Acero, A
    Huang, XD
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 342 - 345
  • [6] Speaker Identification System based on PLP Coefficients and Artificial Neural Network
    Chelali, Fatma Zohra
    Djeradi, Amar
    Djeradi, Rachida
    [J]. WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL II, 2011, : 1641 - 1646
  • [7] Microplastic predictive modelling with the integration of Artificial Neural Networks and Hidden Markov Models (ANN-HMM)
    Sajan, R. Isaac
    Manchu, M.
    Felsy, C.
    Kavitha, M. Joselin
    [J]. JOURNAL OF ENVIRONMENTAL HEALTH SCIENCE AND ENGINEERING, 2024,
  • [8] Speaker identification in the shouted environment using suprasegmental hidden Markov models
    Shahin, Ismail
    [J]. SIGNAL PROCESSING, 2008, 88 (11) : 2700 - 2708
  • [9] Arabic word dependent speaker identification system using artificial neural network
    Al-Qaisi, Aws
    [J]. International Journal of Circuits, Systems and Signal Processing, 2020, 14 : 290 - 295
  • [10] Speaker identification system using empirical mode decomposition and an artificial neural network
    Wu, Jian-Da
    Tsai, Yi-Jang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (05) : 6112 - 6117