Discriminative keyword spotting

被引:73
|
作者
Keshet, Joseph [1 ]
Grangier, David [2 ]
Bengio, Samy [3 ]
机构
[1] IDIAP Res Inst, CH-1920 Martigny, Switzerland
[2] NEC Labs Amer, Princeton, NJ 08540 USA
[3] Google Inc, Mountain View, CA 94043 USA
关键词
Keyword spotting; Spoken term detection; Speech recognition; Large margin and kernel methods; Support vector machines; Discriminative models;
D O I
10.1016/j.specom.2008.10.002
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a new approach for keyword spotting, which is based on large margin and kernel methods rather than on HMMs. Unlike previous approaches, the proposed method employs a discriminative learning procedure, in which the learning phase aims at achieving a high area under the ROC curve, as this quantity is the most common measure to evaluate keyword spotters. The keyword spotter we devise is based oil mapping the input acoustic representation of the speech utterance along with the target keyword into a vector-space. Building on techniques used for large margin and kernel methods for predicting whole sequences, our keyword spotter distills to a classifier in this vector-space, which separates speech utterances in which the keyword is uttered from speech utterances in which the keyword is not uttered. We describe a simple iterative algorithm for training the keyword spotter and discuss its formal properties, showing theoretically that it attains high area under the ROC curve. Experiments on read speech with the TIMIT corpus show that the resulted discriminative system outperforms the conventional context-independent HMM-based system. Further experiments using the TIMIT trained model, but tested oil both read (HTIMIT, WSJ) and spontaneous speech (OGI Stories), show that without further training or adaptation to the new corpus our discriminative system outperforms the conventional context-independent HMM-based system. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:317 / 329
页数:13
相关论文
共 50 条
  • [1] A survey on structured discriminative spoken keyword spotting
    Tabibian, Shima
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (04) : 2483 - 2520
  • [2] A survey on structured discriminative spoken keyword spotting
    Shima Tabibian
    [J]. Artificial Intelligence Review, 2020, 53 : 2483 - 2520
  • [3] An application of recurrent neural networks to discriminative keyword spotting
    Fernandez, Santiago
    Graves, Alex
    Schmidhuber, Juergen
    [J]. ARTIFICIAL NEURAL NETWORKS - ICANN 2007, PT 2, PROCEEDINGS, 2007, 4669 : 220 - +
  • [4] Discriminative Keyword Spotting for limited-data applications
    Benisty, Hadas
    Katz, Itamar
    Crammer, Koby
    Malah, David
    [J]. SPEECH COMMUNICATION, 2018, 99 : 1 - 11
  • [5] A fast hierarchical search algorithm for discriminative keyword spotting
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. INFORMATION SCIENCES, 2016, 336 : 45 - 59
  • [6] Keyword spotting using an evolutionary-based classifier and discriminative features
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (07) : 1660 - 1670
  • [7] Sequence discriminative training for deep learning based acoustic keyword spotting
    Chen, Zhehuai
    Qian, Yanmin
    Yu, Kai
    [J]. SPEECH COMMUNICATION, 2018, 102 : 100 - 111
  • [8] Extension of a Kernel-Based Classifier for Discriminative Spoken Keyword Spotting
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. NEURAL PROCESSING LETTERS, 2014, 39 (02) : 195 - 218
  • [9] Extension of a Kernel-Based Classifier for Discriminative Spoken Keyword Spotting
    Shima Tabibian
    Ahmad Akbari
    Babak Nasersharif
    [J]. Neural Processing Letters, 2014, 39 : 195 - 218
  • [10] Discriminative keyword spotting using triphones information and N-best search
    Tabibian, Shima
    Akbari, Ahmad
    Nasersharif, Babak
    [J]. INFORMATION SCIENCES, 2018, 423 : 157 - 171