A Hybrid HMM/DNN Approach to Keyword Spotting of Short Words

被引:0
|
作者
Chen, I-Fan [1 ]
Lee, Chin-Hui [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
关键词
keyword and filler modeling; keyword detection; utterance verification; deep neural networks; knowledge-based; RECOGNITION; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An HMM/DNN framework is proposed to address the issues of short-word detection. The first-stage keyword hypothesizer is redesigned with a context-aware keyword model and a 9 state filler model to reduce the miss rate from 80% to 6% and increase the figure-of-merit (FOM) from 6.08% to 21.88% for short words. The hypothesizer is followed by a MLP-based second-stage keyword verifier to further reduce its putative hits. To enhance short word detection, three new techniques, including an HMM-based feature transfonnation for the MLPs, knowledge-based features, and deep neural networks, are incorporated into redesigning the verifier. With a set of nine short keywords from the TIMIT set the best FOM we had achieved for the proposed KWS system was 42.79%, which is comparable with that of 42.6% for long content words and much better than the FOM of 18.4% for short keywords reported in previous research [10].
引用
收藏
页码:1573 / 1577
页数:5
相关论文
共 50 条
  • [1] Hybrid HMM/DNN System for Arabic Handwriting Keyword Spotting
    Rouhou, Ahmed Cheikh
    Kessentini, Yousri
    Kanoun, Slim
    IMAGE ANALYSIS AND RECOGNITION, ICIAR 2019, PT I, 2019, 11662 : 216 - 227
  • [2] An approach of keyword spotting based on HMM
    Yan, BF
    Guo, R
    Zhu, XY
    Zhang, B
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 2757 - 2759
  • [3] HYBRID CONTEXT DEPENDENT CD-DNN-HMM KEYWORD SPOTTING (KWS) IN SPEECH CONVERSATIONS
    Tyagi, Vivek
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [4] Exploiting Phoneme Similarities in Hybrid HMM-ANN Keyword Spotting
    Pinto, Joel
    Lovitt, Andrew
    Hermansky, Hynek
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2388 - 2391
  • [5] OPTIMIZE WHAT MATTERS: TRAINING DNN-HMM KEYWORD SPOTTING MODEL USING END METRIC
    Shrivastava, Ashish
    Kundu, Arnav
    Dhir, Chandra
    Naik, Devang
    Tuzel, Oncel
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4000 - 4004
  • [6] HYBRID CONTEXT DEPENDENT CD-DNN-HMM KEYWORDS SPOTTING ON CONTINUOUS SPEECH
    Dridi, Hinda
    Ouni, Kais
    2017 3RD INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2017, : 429 - 435
  • [7] HMM based fast keyword spotting algorithm with no garbage models
    Sunil, S
    Palit, S
    Sreenivas, TV
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1020 - 1023
  • [8] Dynamic handwritten keyword spotting based on the NSHP-HMM
    Choisy, Christophe
    ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 242 - 246
  • [9] A New Keyword Spotting Approach
    Bahi, Halima
    Benati, Nadia
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 77 - +
  • [10] A Hybrid Deep Learning Approach to Keyword Spotting in Vietnamese Stele Images
    Scius-Bertrand A.
    Bui M.
    Fischer A.
    Informatica (Slovenia), 2023, 47 (03): : 361 - 372