ONLINE WORD-SPOTTING IN CONTINUOUS SPEECH WITH RECURRENT NEURAL NETWORKS

被引:0
|
作者
Baljekar, Pallavi [1 ,2 ]
Lehman, Jill Fain [2 ]
Singh, Rita [1 ,2 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] Disney Res, Pittsburgh, PA USA
关键词
Continuous speech; Online word-spotting; Speech recognition; Recurrent neural networks; Gated networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce a simplified architecture for gated recurrent neural networks that can be used in single-pass applications, where word-spotting needs to be done in real-time and phoneme-level information is not available for training. The network operates as a self-contained block in a strictly forward-pass configuration to directly generate keyword labels. We call these simple networks causal networks, where the current output is only weighted by the the past inputs and outputs. Since the basic network has a simpler architecture as compared to traditional memory networks used in keyword spotting, it also requires less data to train. Experiments on a standard speech database highlight the behavior and efficacy of such networks. Comparisons with a standard HMM-based keyword spotter show that these networks, while simple, are still more accurate.
引用
收藏
页码:536 / 541
页数:6
相关论文
共 50 条
  • [21] Syllabic parsing in children: a developmental study using visual word-spotting in Spanish
    Alvarez, Carlos J.
    Garcia-Saavedra, Guacimara
    Luque, Juan L.
    Taft, Marcus
    JOURNAL OF CHILD LANGUAGE, 2017, 44 (02) : 380 - 401
  • [22] RECURRENT NEURAL NETWORKS FOR SPEECH RECOGNITION
    VERDEJO, JED
    HERREROS, AP
    LUNA, JCS
    ORTUZAR, MCB
    AYUSO, AR
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 540 : 361 - 369
  • [23] An Interactive Transcription System of Census Records using Word-Spotting based Information Transfer
    Mas, Joan
    Fornes, Alicia
    Llados, Josep
    PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 54 - 59
  • [24] Effects of prompt style on user responses to an automated banking service using word-spotting
    McInnes, F.R.
    Nairn, I.A.
    Attwater, D.J.
    Jack, M.A.
    British Telecom technology journal, 1999, 17 (01): : 160 - 171
  • [25] Effects of prompt style on user responses to an automated banking service using word-spotting
    McInnes, FR
    Nairn, IA
    Attwater, DJ
    Jack, MA
    BT TECHNOLOGY JOURNAL, 1999, 17 (01) : 160 - 171
  • [26] Graph Convolutional Neural Networks for Learning Attribute Representations for Word Spotting
    Wolf, Fabian
    Fischer, Andreas
    Fink, Gernot A.
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT I, 2021, 12821 : 50 - 64
  • [27] Spotting consonant-vowel units in continuous speech using autoassociative neural networks and support vector machines
    Gangashetty, SV
    Sekhar, CC
    Yegnanarayana, B
    MACHINE LEARNING FOR SIGNAL PROCESSING XIV, 2004, : 401 - 410
  • [28] OCR-independent and Segmentation-free Word-Spotting in Handwritten Arabic Archive Documents
    Aouadi, N.
    Kacem, A.
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND SOFTWARE APPLICATIONS (ICEESA), 2013, : 36 - 41
  • [29] Recurrent Neural Networks for Word Alignment Model
    Tamura, Akihiro
    Watanabe, Taro
    Sumita, Eiichiro
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1470 - 1480
  • [30] A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier
    Tavoli, Reza
    Keyvanpour, Mohammadreza
    APPLIED ARTIFICIAL INTELLIGENCE, 2017, 31 (04) : 346 - 375