A GRAMMAR COMPILER FOR CONNECTED SPEECH RECOGNITION

被引:4
|
作者
BROWN, MK
WILPON, JG
机构
[1] AT&T Bell Laboratories, NJ 07974, Murray Hill
关键词
D O I
10.1109/78.80761
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
It is well known that syntactic constraints, when applied to speech recognition, greatly improve accuracy. However, until recently, constructing an efficient grammar specification for use by a connected word speech recognizer was performed by hand and has been a tedious, time-consuming task prone to error. For this reason, very large grammars have not appeared. We describe a compiler for constructing optimized syntactic digraphs from easily written grammar specifications. These are written in a language called grammar specification language (GSL). The compiler has a preprocessing (macroexpansion) phase, a parse phase, graph code generation and compilation phases, and three optimization phases. Digraphs can also be linked together by a graph linker to form larger diagraphs. Language complexity is analyzed in a statistics phase. Heretofore, computer generated digraphs were often filled with redundancies. Larger graphs were constructed and optimized by hand in order to achieve the required efficiency. We demonstrate that the optimization phase yields graphs with even greater efficiency than previously achieved by hand. We also discuss some preliminary speech recognition results of applying these techniques to intermediate and large graphs. With the introduction of these tools it is now possible to provide a speech recognition user with the ability to define new task grammars in the field. GSL has been used by several untutored users with good success. Experience with GSL indicates that it is a viable medium for quickly and accurately defining grammars for use in connected speech recognition systems.
引用
收藏
页码:17 / 28
页数:12
相关论文
共 50 条
  • [31] A VHDL COMPILER BASED ON ATTRIBUTE GRAMMAR METHODOLOGY
    FARROW, R
    STANCULESCU, AG
    SIGPLAN NOTICES, 1989, 24 (07): : 120 - 130
  • [32] GENERATING A PRODUCTION COMPILER FROM AN ATTRIBUTE GRAMMAR
    FARROW, R
    IEEE SOFTWARE, 1984, 1 (04) : 77 - 93
  • [33] Indonesian Speech Recognition Grammar Using Kinect 2.0 for Controlling Humanoid Robot
    Tambunan, Mario Herryn
    Martin
    Fakhruroja, Hanif
    Riyanto
    Machbub, Carmadi
    2018 INTERNATIONAL CONFERENCE ON SIGNALS AND SYSTEMS (ICSIGSYS), 2018, : 59 - 63
  • [34] CONTINUOUS SPEECH RECOGNITION USING A DEPENDENCY GRAMMAR AND PHONEME-BASED HMMS
    MATSUNAGA, S
    HOMMA, S
    SAGAYAMA, S
    FURUI, S
    IEICE TRANSACTIONS ON COMMUNICATIONS ELECTRONICS INFORMATION AND SYSTEMS, 1991, 74 (07): : 1826 - 1833
  • [35] Integrating a non-probabilistic grammar into large vocabulary continuous speech recognition
    Beutler, R
    Kaufmann, T
    Pfister, B
    2005 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2005, : 104 - 109
  • [36] Connected phoneme HMMs with implicit duration modelling for better speech recognition
    Ramachandrula, S
    Thippur, S
    ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 1024 - 1028
  • [37] Monophone-based connected word Hindi speech recognition improvement
    Bhatt S.
    Jain A.
    Dev A.
    Sadhana - Academy Proceedings in Engineering Sciences, 2021, 46 (02)
  • [38] SIMULATION OF A RECOGNITION SYSTEM FOR CONNECTED SPEECH SOUNDS USING LINGUISTIC INFORMATION
    TAKEYA, SI
    KAWAGUCHI, E
    ELECTRONICS & COMMUNICATIONS IN JAPAN, 1973, 56 (09): : 38 - 46
  • [39] Recognition of connected digit speech in Japanese collected over the telephone network
    Kawai, H
    Shimizu, T
    Higuchi, N
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2001, E84D (03): : 374 - 383
  • [40] Recognition of connected digit speech in Japanese collected over the telephone network
    KDD R and D Laboratories Inc., Kamifukuoka-shi, 356-8502, Japan
    IEICE Transactions on Information and Systems, 2001, E84-D (03) : 374 - 383