Recognizing disfluencies in conversational speech

被引:21
|
作者
Lease, Matthew [1 ]
Johnson, Mark
Charniak, Eugene
机构
[1] Brown Univ, BLLIP, Dept Comp Sci, Providence, RI 02912 USA
[2] Brown Univ, BLLIP, Dept Cognit & Linguist Sci, Providence, RI 02912 USA
基金
美国国家科学基金会;
关键词
disfluency modeling; natural language processing; rich transcription; speech processing;
D O I
10.1109/TASL.2006.878269
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a system for modeling disfluency in conversational speech: repairs, fillers, and self-interruption points (IPs). For each sentence, candidate repair analyses are generated by a stochastic tree adjoining grammar (TAG) noisy-channel model. A probabilistic syntactic language model scores the fluency of each analysis, and a maximum-entropy model selects the most likely analysis given the language model score and other features. Fillers are detected independently via a small set of deterministic rules, and IN are detected by combining the output of repair and filler detection modules. In the recent Rich Transcription Fall 2004 (RT-04F) blind evaluation, systems competed to detect these three forms of disfluency under two input conditions: a best-case scenario of manually transcribed words and a fully automatic case of automatic speech recognition (ASR) output. For all three tasks and on both types of input, our system was the top performer in the evaluation.
引用
收藏
页码:1566 / 1573
页数:8
相关论文
共 50 条
  • [41] Statistical measurements on conversational speech
    Dunn, HK
    White, SD
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1940, 11 (03): : 278 - 288
  • [42] On the analysis of speech and disfluencies for automatic detection of Mild Cognitive Impairment
    K. López-de-Ipiña
    U. Martinez-de-Lizarduy
    P. M. Calvo
    B. Beitia
    J. García-Melero
    E. Fernández
    M. Ecay-Torres
    M. Faundez-Zanuy
    P. Sanz
    [J]. Neural Computing and Applications, 2020, 32 : 15761 - 15769
  • [43] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
    CHAPMAN, WD
    BEETLE, DH
    [J]. DESIGN NEWS, 1971, 26 (07) : 125 - &
  • [44] ECHOIC CONTROL IN CONVERSATIONAL SPEECH
    BOE, R
    WINOKUR, S
    [J]. JOURNAL OF GENERAL PSYCHOLOGY, 1978, 99 (02): : 299 - 304
  • [45] Hybridizing Conversational and Clear Speech
    Kusumoto, Akiko
    Kain, Alexander B.
    Hosom, John-Paul
    van Santen, Jan R. H.
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 161 - 164
  • [46] A COMPARISON OF CONVERSATIONAL AND AUDIENCE SPEECH
    Voelker, Charles H.
    [J]. JOURNAL OF SPEECH DISORDERS, 1938, 3 (04): : 234 - 234
  • [47] Conversational telephone speech recognition
    Gauvain, JL
    Lamel, L
    Schwenk, H
    Adda, G
    Chen, L
    Lefèvre, F
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 212 - 215
  • [48] USE OF PROFANITY IN CONVERSATIONAL SPEECH
    NERBONNE, GP
    HIPSKIND, NM
    [J]. JOURNAL OF COMMUNICATION DISORDERS, 1972, 5 (01) : 47 - 50
  • [49] CONTROL OF MACHINES BY CONVERSATIONAL SPEECH
    CHAPMAN, WD
    BEETLE, DH
    [J]. MECHANICAL ENGINEERING, 1971, 93 (07): : 45 - &
  • [50] PERIODIC RHYTHMS IN CONVERSATIONAL SPEECH
    WARNER, RM
    [J]. LANGUAGE AND SPEECH, 1979, 22 (OCT-) : 381 - 396