Modeling disfluencies in conversational speech

被引:0
|
作者
Siu, M
Ostendorf, M
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Conversational speech is notably different from read speech in several ways, particularly in the presence of disfluencies but also in the frequent use of a small set of words that mark the flow of the discourse. Disfluencies are sometimes viewed as a ''problem'' in language modeling, where most previous work has focused on written text. In this paper, we take the view that disfluencies provide information themselves. in particular, we give evidence that filled pauses serve different functions, including marking linguistic unit and restart boundaries, and signaling hesitation where the speaker wants to hold the floor. The different functions can be connected to similar functions of other words common in spontaneous but not written speech, and the particular function affects the word conditioning choices in a variable ngram model. Thus, at least some of the idiosyncrasies of spontaneous speech can be viewed as a source of information for language modeling rather than an interruption in the linguistic structure.
引用
收藏
页码:386 / 389
页数:4
相关论文
共 50 条
  • [1] Recognizing disfluencies in conversational speech
    Lease, Matthew
    Johnson, Mark
    Charniak, Eugene
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1566 - 1573
  • [2] Micro-Structure of Disfluencies: Basics for Conversational Speech Synthesis
    Betz, Simon
    Wagner, Petra
    Schlangen, David
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2222 - 2226
  • [3] Statistical language modeling for speech disfluencies
    Stolcke, A
    Shriberg, E
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 405 - 408
  • [4] SPEECH DISFLUENCIES MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS
    Vasilisa, Verkhodanova O.
    Alexey, Karpov A.
    [J]. TOMSK STATE UNIVERSITY JOURNAL, 2012, (363): : 10 - +
  • [5] Disfluencies in cluttered speech
    Myers, Florence L.
    Bakker, Klaas
    St Louis, Kenneth O.
    Raphael, Lawrence J.
    [J]. JOURNAL OF FLUENCY DISORDERS, 2012, 37 (01) : 9 - 19
  • [6] Latent Prosodic Modeling (LPM) for Speech with Applications in Recognizing Spontaneous Mandarin Speech with Disfluencies
    Lin, Che-Kuang
    Lee, Lin-Shan
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2390 - 2393
  • [7] Disfluencies in the speech of intoxicated speakers
    Schiel, Florian
    Heinrich, Christian
    [J]. INTERNATIONAL JOURNAL OF SPEECH LANGUAGE AND THE LAW, 2015, 22 (01) : 19 - 33
  • [8] Modeling of VoIP conversational speech in noisy environment
    Pragtong, Padungkrit
    Erke, Tapio J.
    Ahmed, Kazi M.
    [J]. TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 2226 - +
  • [9] LOCI OF DISFLUENCIES IN SPEECH OF STUTTERERS
    SILVERMAN, FH
    WILLIAMS, DE
    [J]. PERCEPTUAL AND MOTOR SKILLS, 1967, 24 (3P2) : 1085 - +
  • [10] VARIATIONS IN NORMAL SPEECH DISFLUENCIES
    BROEN, PA
    SIEGEL, GM
    [J]. LANGUAGE AND SPEECH, 1972, 15 (JUL-S) : 219 - 231