A study on user defined spoken wake-up word recognition system using deep neural network-hidden Markov model hybrid model

被引:0
|
作者
Yoon, Ki-mu [1 ]
Kim, Wooil [1 ]
机构
[1] Incheon Natl Univ, Dept Comp Sci & Engn, 119 Acad Ro, Incheon 22012, South Korea
来源
关键词
User-defined wake up word; Wake up word recognition; Automatic speech recognition; Deep neural network;
D O I
10.7776/ASK.2020.39.2.131
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Wake Up Word (WUW) is a short utterance used to convert speech recognizer to recognition mode. The WUW defined by the user who actually use the speech recognizer is called user-defined WUW. In this paper, to recognize user-defined WUW, we construct traditional Gaussian Mixture Model-Hidden Markov Model (GMM-HMM), Linear Discriminant Analysis (LDA)-GMM-HMM and LDA-Deep Neural Network (DNN)HMM based system and compare their performances. Also, to improve recognition accuracy of the WUW system, a threshold method is applied to each model, which significantly reduces the error rate of the WUW recognition and the rejection failure rate of non-WUW simultaneously. For LDA-DNN-HMM system, when the WUW error rate is 9.84 %, the rejection failure rate of non-WUW is 0.0058 %, which is about 4.82 times lower than the LDA-GMM-HMM system. These results demonstrate that LDA-DNN-HMM model developed in this paper proves to be highly effective for constructing user-defined WUW recognition system.
引用
收藏
页码:131 / 136
页数:6
相关论文
共 41 条
  • [1] Continuous action segmentation and recognition using hybrid convolutional neural network-hidden Markov model model
    Lei, Jun
    Li, Guohui
    Zhang, Jun
    Guo, Qiang
    Tu, Dan
    [J]. IET COMPUTER VISION, 2016, 10 (06) : 537 - 544
  • [2] GLOBAL OPTIMIZATION OF A NEURAL NETWORK-HIDDEN MARKOV MODEL HYBRID
    BENGIO, Y
    DEMORI, R
    FLAMMIA, G
    KOMPE, R
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (02): : 252 - 259
  • [3] A Neural Network Hidden Markov Model Hybrid for cursive word recognition
    Knerr, S
    Augustin, E
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1518 - 1520
  • [4] Offline handwritten word recognition using a hybrid neural network and Hidden Markov model
    Tay, YH
    Lallican, PM
    Khalid, M
    Viard-Gaudin, C
    Knerr, S
    [J]. ISSPA 2001: SIXTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1 AND 2, PROCEEDINGS, 2001, : 382 - 385
  • [5] Hybrid hidden Markov model neural network system for EMG signals recognition
    Kwon, J
    Min, H
    Hong, S
    [J]. PROCEEDINGS OF THE 18TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 18, PTS 1-5, 1997, 18 : 1468 - 1469
  • [6] Hybrid approaches to frontal view face recognition using the hidden Markov model and neural network
    Yoon, KS
    Ham, YK
    Park, RH
    [J]. PATTERN RECOGNITION, 1998, 31 (03) : 283 - 293
  • [7] Hybrid Hidden Markov Model and Artificial Neural Network for Automatic Speech Recognition
    Tang, Xian
    [J]. PROCEEDINGS OF THE 2009 PACIFIC-ASIA CONFERENCE ON CIRCUITS, COMMUNICATIONS AND SYSTEM, 2009, : 682 - 685
  • [8] Hybrid Deep Neural Network - Hidden Markov Model Based Network Traffic Classification
    Tan, Xincheng
    Xie, Yi
    [J]. COMMUNICATIONS AND NETWORKING, CHINACOM 2018, 2019, 262 : 604 - 614
  • [9] Hybrid Deep Neural Network - Hidden Markov Model (DNN-HMM) Based Speech Emotion Recognition
    Li, Longfei
    Zhao, Yong
    Jiang, Dongmei
    Zhang, Yanning
    Wang, Fengna
    Gonzalez, Isabel
    Valentin, Enescu
    Sahli, Hichem
    [J]. 2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 312 - 317
  • [10] A comprehensive study of hybrid neural network hidden Markov model for offline handwritten Chinese text recognition
    Zi-Rui Wang
    Jun Du
    Wen-Chao Wang
    Jian-Fang Zhai
    Jin-Shui Hu
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2018, 21 : 241 - 251