A CORPUS-BASED STUDY OF REPAIR CUES IN SPONTANEOUS SPEECH

被引:73
|
作者
NAKATANI, CH [1 ]
HIRSCHBERG, J [1 ]
机构
[1] AT&T BELL LABS,MURRAY HILL,NJ 07974
来源
关键词
D O I
10.1121/1.408547
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The occurrence of disfluencies in fully natural speech poses difficult challenges for spoken language understanding systems. For example, although self-repairs occur in about 10% of spontaneous utterances, they are often unmodeled in speech recognition systems. This is partly due to the fact that little is known about the extent to which cues in the speech signal may facilitate automatic repair processing. In this paper, acoustic and prosodic cues to self-repairs are identified, based on an analysis of a corpus taken from the ARPA Air Travel Information System database, and methods are proposed for exploiting these cues for repair detection, especially the task of modeling word fragments, and repair correction. The relative contributions of these speech-based cues, as well as other text-based repair cues, are examined in a statistical model of repair site detection that achieves a precision rate of 91% and recall of 86% on a prosodically labeled corpus of repair utterances.
引用
收藏
页码:1603 / 1616
页数:14
相关论文
共 50 条
  • [41] A Corpus-based Study on the Collocation of Run
    靳红玉
    海外英语, 2010, (12) : 100 - 101
  • [42] A Corpus-based Study on the Use of Pretty
    Jung, Yeonchang
    LINGUISTIC RESEARCH, 2011, 28 (02) : 329 - 354
  • [43] A Corpus-Based Study of Counterfactuals in Mandarin
    Yong, Qian
    LANGUAGE AND LINGUISTICS, 2016, 17 (06) : 891 - 915
  • [44] OBITUARIES IN TRANSLATION: A CORPUS-BASED STUDY
    Rebechi, Rozane Rodrigues
    CADERNOS DE TRADUCAO, 2018, 38 (03): : 298 - 318
  • [45] Corpus-Based Study on Backchannel "Yes"
    Xia Pengzheng
    Zhang Xiaoyu
    PROCEEDINGS OF THE FOURTH NORTHEAST ASIA INTERNATIONAL SYMPOSIUM ON LANGUAGE, LITERATURE AND TRANSLATION, 2015, 2015, : 145 - 151
  • [46] A Corpus-based Analysis of Mixed Code in Hong Kong Speech
    Lee, John
    2012 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2012), 2012, : 165 - 168
  • [47] Corpus-based Malay Text-to-Speech Synthesis System
    Swee, Tan Tian
    Salleh, Sheikh Hussain Shaikh
    2008 14TH ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS, (APCC), VOLS 1 AND 2, 2008, : 52 - 56
  • [48] A corpus-based study of speech-act report verbs as a feature of translators' style
    Winters, Marion
    META, 2007, 52 (03) : 412 - 425
  • [49] Speech Database Reduction Method for Corpus-Based TTS System
    Isogai, Mitsuaki
    Mizuno, Hideyuki
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 158 - 161
  • [50] A Corpus-Based Approach to Speech Enhancement From Nonstationary Noise
    Ming, Ji
    Srinivasan, Ramji
    Crookes, Danny
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 822 - 836