A GRAMMAR COMPILER FOR CONNECTED SPEECH RECOGNITION

被引:4
|
作者
BROWN, MK
WILPON, JG
机构
[1] AT&T Bell Laboratories, NJ 07974, Murray Hill
关键词
D O I
10.1109/78.80761
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is well known that syntactic constraints, when applied to speech recognition, greatly improve accuracy. However, until recently, constructing an efficient grammar specification for use by a connected word speech recognizer was performed by hand and has been a tedious, time-consuming task prone to error. For this reason, very large grammars have not appeared. We describe a compiler for constructing optimized syntactic digraphs from easily written grammar specifications. These are written in a language called grammar specification language (GSL). The compiler has a preprocessing (macroexpansion) phase, a parse phase, graph code generation and compilation phases, and three optimization phases. Digraphs can also be linked together by a graph linker to form larger diagraphs. Language complexity is analyzed in a statistics phase. Heretofore, computer generated digraphs were often filled with redundancies. Larger graphs were constructed and optimized by hand in order to achieve the required efficiency. We demonstrate that the optimization phase yields graphs with even greater efficiency than previously achieved by hand. We also discuss some preliminary speech recognition results of applying these techniques to intermediate and large graphs. With the introduction of these tools it is now possible to provide a speech recognition user with the ability to define new task grammars in the field. GSL has been used by several untutored users with good success. Experience with GSL indicates that it is a viable medium for quickly and accurately defining grammars for use in connected speech recognition systems.
引用
收藏
页码:17 / 28
页数:12
相关论文
共 50 条
  • [21] CONNECTED SPEECH RECOGNITION SYSTEM DP-100
    TSURUTA, S
    SAKOE, H
    CHIBA, S
    NAKADA, T
    NEC RESEARCH & DEVELOPMENT, 1980, (56): : 88 - 94
  • [22] COMPUTER RECOGNITION OF CONTINUANT PHONEMES IN CONNECTED ENGLISH SPEECH
    NIEDERJOHN, RJ
    THOMAS, IB
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (06): : 526 - 535
  • [23] Connected digit speech recognition system for Malayalam language
    Kurian, Cini
    Balakrishnan, Kannan
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2013, 38 (06): : 1339 - 1346
  • [24] SPEAKER-INDEPENDENT WORD RECOGNITION IN CONNECTED SPEECH ON THE BASIS OF PHONEME RECOGNITION
    MAENOBU, K
    ARIKI, Y
    SAKAI, T
    INFORMATION SCIENCES, 1984, 33 (1-2) : 31 - 61
  • [25] Unsupervised pronunciation grammar generation for non-native speech recognition
    Huang, Chien-Lin
    Wu, Chung-Hsien
    Chen, Yi
    Hsu, Chin-Shun
    Lee, Kuei-Ming
    TENCON 2007 - 2007 IEEE REGION 10 CONFERENCE, VOLS 1-3, 2007, : 452 - +
  • [26] Efficient decoding algorithms for Mandarin Connected Digit Speech Recognition
    Zhu, X
    Li, HS
    Lu, J
    Liu, RS
    PROCEEDINGS OF 2001 INTERNATIONAL SYMPOSIUM ON INTELLIGENT MULTIMEDIA, VIDEO AND SPEECH PROCESSING, 2001, : 555 - 558
  • [27] Bell labs connected digit databases for a telephone speech recognition
    Zhou, Q
    Zitouni, I
    Li, Q
    2003 INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, PROCEEDINGS, 2003, : 351 - 356
  • [28] Audio-Visual Automatic Speech Recognition for Connected Digits
    Wang, Xiaoping
    Hao, Yufeng
    Fu, Degang
    Yuan, Chunwei
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL III, PROCEEDINGS, 2008, : 328 - +
  • [29] RODIGITS - A ROMANIAN CONNECTED-DIGITS SPEECH CORPUS FOR AUTOMATIC SPEECH AND SPEAKER RECOGNITION
    Georgescu, Alexandru Lucian
    Caranica, Alexandru
    Cucu, Horia
    Burileanu, Corneliu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2018, 80 (03): : 45 - 62
  • [30] Speech adaptation using neural networks for connected digit recognition
    Cheng, XL
    Wang, H
    Li, ZG
    ICONIP'02: PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING: COMPUTATIONAL INTELLIGENCE FOR THE E-AGE, 2002, : 2401 - 2404