ONLINE WORD-SPOTTING IN CONTINUOUS SPEECH WITH RECURRENT NEURAL NETWORKS

被引:0
|
作者
Baljekar, Pallavi [1 ,2 ]
Lehman, Jill Fain [2 ]
Singh, Rita [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Disney Res, Pittsburgh, PA USA
关键词
Continuous speech; Online word-spotting; Speech recognition; Recurrent neural networks; Gated networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce a simplified architecture for gated recurrent neural networks that can be used in single-pass applications, where word-spotting needs to be done in real-time and phoneme-level information is not available for training. The network operates as a self-contained block in a strictly forward-pass configuration to directly generate keyword labels. We call these simple networks causal networks, where the current output is only weighted by the the past inputs and outputs. Since the basic network has a simpler architecture as compared to traditional memory networks used in keyword spotting, it also requires less data to train. Experiments on a standard speech database highlight the behavior and efficacy of such networks. Comparisons with a standard HMM-based keyword spotter show that these networks, while simple, are still more accurate.
引用
收藏
页码:536 / 541
页数:6
相关论文
共 50 条
  • [41] Variational Recurrent Neural Networks for Speech Separation
    Chien, Jen-Tzung
    Kuo, Kuan-Ting
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1193 - 1197
  • [42] Speech recognition with hierarchical recurrent neural networks
    Natl Chiao Tung Univ, Hsinchu, Taiwan
    Pattern Recognit, 6 (795-805):
  • [43] Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting
    Li, Nan
    Chen, Jinying
    Cao, Huaigu
    Zhang, Bing
    Natarajan, Prem
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 134 - 139
  • [44] Online text prediction with recurrent neural networks
    Pérez-Ortiz, JA
    Calera-Rubio, J
    Forcada, ML
    NEURAL PROCESSING LETTERS, 2001, 14 (02) : 127 - 140
  • [45] Inaccessibility in online learning of recurrent neural networks
    Saito, A
    Taiji, M
    Ikegami, T
    PHYSICAL REVIEW LETTERS, 2004, 93 (16) : 168101 - 1
  • [46] Online Text Prediction with Recurrent Neural Networks
    Juan Antonio Pérez-Ortiz
    Jorge Calera-Rubio
    Mikel L. Forcada
    Neural Processing Letters, 2001, 14 : 127 - 140
  • [47] Nonlinear enhancement of noisy speech, using continuous attractor dynamics formed in recurrent neural networks
    Dehyadegary, Louiza
    Seyyedsalehi, Seyyed Ali
    Nejadgholi, Isar
    NEUROCOMPUTING, 2011, 74 (17) : 2716 - 2724
  • [48] NEURAL NETWORKS FOR STATISTICAL RECOGNITION OF CONTINUOUS SPEECH
    MORGAN, N
    BOURLARD, HA
    PROCEEDINGS OF THE IEEE, 1995, 83 (05) : 742 - 770
  • [49] Continuous speech recognition by convolutional neural networks
    Zhang, Qing-Qing
    Liu, Yong
    Pan, Jie-Lin
    Yan, Yong-Hong
    Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2015, 37 (09): : 1212 - 1217
  • [50] The role of the syllable in lexical segmentation in French: Word-spotting data (vol 81, pg 144, 2002)
    Dumay, N
    Frauenfelder, UH
    Content, A
    BRAIN AND LANGUAGE, 2002, 83 (02) : 362 - 363