ONLINE WORD-SPOTTING IN CONTINUOUS SPEECH WITH RECURRENT NEURAL NETWORKS

被引:0
|
作者
Baljekar, Pallavi [1 ,2 ]
Lehman, Jill Fain [2 ]
Singh, Rita [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Disney Res, Pittsburgh, PA USA
关键词
Continuous speech; Online word-spotting; Speech recognition; Recurrent neural networks; Gated networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce a simplified architecture for gated recurrent neural networks that can be used in single-pass applications, where word-spotting needs to be done in real-time and phoneme-level information is not available for training. The network operates as a self-contained block in a strictly forward-pass configuration to directly generate keyword labels. We call these simple networks causal networks, where the current output is only weighted by the the past inputs and outputs. Since the basic network has a simpler architecture as compared to traditional memory networks used in keyword spotting, it also requires less data to train. Experiments on a standard speech database highlight the behavior and efficacy of such networks. Comparisons with a standard HMM-based keyword spotter show that these networks, while simple, are still more accurate.
引用
收藏
页码:536 / 541
页数:6
相关论文
共 50 条
  • [1] A RECORD WORD-SPOTTING MECHANISM
    Heacock, R. H.
    JOURNAL OF THE SOCIETY OF MOTION PICTURE ENGINEERS, 1937, 28 (01): : 63 - 72
  • [2] A Novel Word Spotting Method Based on Recurrent Neural Networks
    Frinken, Volkmar
    Fischer, Andreas
    Manmatha, R.
    Bunke, Horst
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 211 - 224
  • [3] An Automatic Word-spotting Framework for Medieval Manuscripts
    Pintus, Ruggero
    Yang, Ying
    Gobbetti, Enrico
    Rushmeier, Holly
    2015 DIGITAL HERITAGE INTERNATIONAL CONGRESS, VOL 2: ANALYSIS & INTERPRETATION THEORY, METHODOLOGIES, PRESERVATION & STANDARDS DIGITAL HERITAGE PROJECTS & APPLICATIONS, 2015, : 5 - 12
  • [4] A Classification-free Word-Spotting System
    Vassilopoulos, Nikos
    Kavallieratou, Ergina
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [5] Automatic Synthesis of Historical Arabic Text for Word-Spotting
    Kassis, Majeed
    El-Sana, Jihad
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 239 - 244
  • [6] Visual keyword based word-spotting in handwritten documents
    Kolcz, A
    Alspector, J
    Augusteijn, M
    Carlson, R
    Popescu, GV
    DOCUMENT RECOGNITION V, 1998, 3305 : 185 - 193
  • [7] Word Spotting in Continuous Speech Using Wavelet Transform
    Khan, Wasiq
    Jiang, Ping
    Holton, Rob
    2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 275 - 279
  • [8] Word-spotting based on inter-word and intra-word diphone models
    Nitta, T
    Tanaka, S
    Masai, Y
    Matsuura, H
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1093 - 1096
  • [9] Unsupervised writer adaptation of whole-word HMMs with application to word-spotting
    Rodriguez-Serrano, Jose A.
    Perronnin, Florent
    Sanchez, Gemma
    Llados, Josep
    PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 742 - 749
  • [10] Fast implementation methods for Viterbi-based word-spotting
    Knill, KM
    Young, SJ
    1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 522 - 525