Integrated recognition of words and prosodic phrase boundaries

被引:20
|
作者
Gallwitz, F [1 ]
Niemann, H [1 ]
Nöth, E [1 ]
Warnke, V [1 ]
机构
[1] Univ Erlangen Nurnberg, Chair Pattern Recognit, D-91058 Erlangen, Germany
关键词
speech recognition; prosody; speech understanding;
D O I
10.1016/S0167-6393(01)00027-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an integrated approach for recognizing both the word sequence and the syntactic-prosodic structure of a spontaneous utterance. The approach aims at improving the performance of the understanding component of speech understanding systems by exploiting not only acoustic-phonetic and syntactic information, but also prosodic information directly within the speech recognition process. Whereas spoken utterances are typically modelled as unstructured word sequences in the speech recognizer, our approach includes phrase boundary information in the language model and provides HMMs to model the acoustic and prosodic characteristics of phrase boundaries. This methodology has two major advantages compared to purely word-based speech recognizers. First, additional syntactic-prosodic boundaries are determined by the speech recognizer which facilitates parsing and resolve syntactic and semantic ambiguities. Second - after having removed the boundary information from the result of the recognizer - the integrated model yields a 4% relative word error rate (WER) reduction compared to a traditional word recognizer. The boundary classification performance is equal to that of a separate prosodic classifier operating on the word recognizer output, thus making a separate classifier unnecessary for this task and saving the computation time involved. Compared to the baseline word recognizer, the integrated word-and-boundary recognizer does not involve any computational overhead. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:81 / 95
页数:15
相关论文
共 50 条
  • [41] Nonlocal effects of prosodic boundaries
    Carlson, Katy
    Clifton, Charles, Jr.
    Frazier, Lyn
    MEMORY & COGNITION, 2009, 37 (07) : 1014 - 1025
  • [42] Phrase Lengths and the Perceived Informativeness of Prosodic Cues in Turkish
    Deniz, Nazik Dinctopal
    Fodor, Janet Dean
    LANGUAGE AND SPEECH, 2017, 60 (04) : 505 - 529
  • [43] An integrative approach of accentuation relationships in the prosodic phrase in French
    Di Cristo, Albert
    JOURNAL OF FRENCH LANGUAGE STUDIES, 2011, 21 (01) : 73 - 95
  • [44] Prosodic cues for automatic phrase boundary detection in ASR
    Klara, Vicsi
    Gyorgy, Szaszak
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2006, 4188 : 547 - +
  • [45] Prosodic Boundaries and Prosodic Word in Chengdu Dialect: a Durational Perspective
    Qin, Zuxuan
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 595 - 598
  • [46] The development of prosodic structure in early words
    Prieto, P
    JOURNAL OF LINGUISTICS, 2005, 41 (01) : 214 - 218
  • [47] Recursive prosodic words in Kaqchikel (Mayan)
    Bennett, Ryan
    GLOSSA-A JOURNAL OF GENERAL LINGUISTICS, 2018, 3 (01):
  • [48] Prosodic marking of syntactic boundaries in Khoekhoe
    Tulchynska, Kira
    Job, Sylvanus
    Witzlack-Makarevich, Alena
    Zellers, Margaret
    INTERSPEECH 2024, 2024, : 3684 - 3688
  • [49] Effects of prosodic boundaries on syntactic disambiguation
    Kang, S
    Speer, SR
    STUDIA LINGUISTICA, 2005, 59 (2-3) : 244 - 258
  • [50] Prosodic markers at syntactic boundaries in Spanish
    Payeras, J
    PROCEEDINGS OF THE TENTH JOURNEES DE LINGUISTIQUE (1996), 1996, B-20 : 139 - 143