ONLINE WORD-SPOTTING IN CONTINUOUS SPEECH WITH RECURRENT NEURAL NETWORKS

被引：0

作者：

Baljekar, Pallavi ^{[1
,2
]}

Lehman, Jill Fain ^{[2
]}

Singh, Rita ^{[1
,2
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Disney Res, Pittsburgh, PA USA

来源：

2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014 | 2014年

关键词：

Continuous speech; Online word-spotting; Speech recognition; Recurrent neural networks; Gated networks;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper we introduce a simplified architecture for gated recurrent neural networks that can be used in single-pass applications, where word-spotting needs to be done in real-time and phoneme-level information is not available for training. The network operates as a self-contained block in a strictly forward-pass configuration to directly generate keyword labels. We call these simple networks causal networks, where the current output is only weighted by the the past inputs and outputs. Since the basic network has a simpler architecture as compared to traditional memory networks used in keyword spotting, it also requires less data to train. Experiments on a standard speech database highlight the behavior and efficacy of such networks. Comparisons with a standard HMM-based keyword spotter show that these networks, while simple, are still more accurate.

引用

页码：536 / 541

页数：6

共 50 条

[1] A RECORD WORD-SPOTTING MECHANISM
Heacock, R. H.
JOURNAL OF THE SOCIETY OF MOTION PICTURE ENGINEERS, 1937, 28 (01): : 63 - 72
[2] A Novel Word Spotting Method Based on Recurrent Neural Networks
Frinken, Volkmar
Fischer, Andreas
Manmatha, R.
Bunke, Horst
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 211 - 224
[3] An Automatic Word-spotting Framework for Medieval Manuscripts
Pintus, Ruggero
Yang, Ying
Gobbetti, Enrico
Rushmeier, Holly
2015 DIGITAL HERITAGE INTERNATIONAL CONGRESS, VOL 2: ANALYSIS & INTERPRETATION THEORY, METHODOLOGIES, PRESERVATION & STANDARDS DIGITAL HERITAGE PROJECTS & APPLICATIONS, 2015, : 5 - 12
[4] A Classification-free Word-Spotting System
Vassilopoulos, Nikos
Kavallieratou, Ergina
DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
[5] Automatic Synthesis of Historical Arabic Text for Word-Spotting
Kassis, Majeed
El-Sana, Jihad
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, : 239 - 244
[6] Visual keyword based word-spotting in handwritten documents
Kolcz, A
Alspector, J
Augusteijn, M
Carlson, R
Popescu, GV
DOCUMENT RECOGNITION V, 1998, 3305 : 185 - 193
[7] Word Spotting in Continuous Speech Using Wavelet Transform
Khan, Wasiq
Jiang, Ping
Holton, Rob
2014 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2014, : 275 - 279
[8] Word-spotting based on inter-word and intra-word diphone models
Nitta, T
Tanaka, S
Masai, Y
Matsuura, H
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1093 - 1096
[9] Unsupervised writer adaptation of whole-word HMMs with application to word-spotting
Rodriguez-Serrano, Jose A.
Perronnin, Florent
Sanchez, Gemma
Llados, Josep
PATTERN RECOGNITION LETTERS, 2010, 31 (08) : 742 - 749
[10] Fast implementation methods for Viterbi-based word-spotting
Knill, KM
Young, SJ
1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 522 - 525

← 1 2 3 4 5 →