Micro-Structure of Disfluencies: Basics for Conversational Speech Synthesis

被引:0
|
作者
Betz, Simon [1 ,2 ]
Wagner, Petra [1 ]
Schlangen, David [2 ]
机构
[1] Univ Bielefeld, Phonet & Phonol Workgrp, Bielefeld, Germany
[2] Univ Bielefeld, Dialogue Syst Grp, Bielefeld, Germany
关键词
speech synthesis; disfluencies; spontaneous speech; dialogue systems; incrementality;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Incremental dialogue systems can produce fast responses and can interact in a human-like fashion. However, these systems occasionally produce erroneous material or run out of things to say. Humans in such situations use disfluencies to remedy their ongoing production and signal this to the listener. We devised a new model for inserting disfluencies into synthesis and evaluated this approach in a perception test. It showed that lengthenings and silent pauses can be built for speech synthesis with low effort and high output quality. Synthesized word fragments and filled pauses, while potentially useful in incremental dialogue systems, appear more difficult to handle for listeners. While we were able to get consistently high ratings for certain types of disfluencies, the need for more basic research on their micro structure became apparent in order to be able to synthesize the fine phonetic detail of disfluencies. For this, we analysed corpus data with regard to distributional and durational aspects of lengthenings, word fragments and pauses. Based on these natural speaking strategies, we explored further to what extent speech can be delayed using disfluency strategies, and how to handle difficult disfluency elements by determining the appropriate amount of durational variation applicable.
引用
收藏
页码:2222 / 2226
页数:5
相关论文
共 50 条
  • [1] Modeling disfluencies in conversational speech
    Siu, M
    Ostendorf, M
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 386 - 389
  • [2] Recognizing disfluencies in conversational speech
    Lease, Matthew
    Johnson, Mark
    Charniak, Eugene
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1566 - 1573
  • [3] The micro-structure of attention
    Taylor, Neill R.
    Hartley, Matthew
    Taylor, John G.
    [J]. NEURAL NETWORKS, 2006, 19 (09) : 1347 - 1370
  • [4] OPTICAL INVESTIGATION OF THE MICRO-STRUCTURE
    BROOKS, A
    [J]. METALLURGIA, 1984, 51 (03): : 104 - 105
  • [5] MICRO-STRUCTURE IN LINEAR ELASTICITY
    MINDLIN, RD
    [J]. ARCHIVE FOR RATIONAL MECHANICS AND ANALYSIS, 1964, 16 (01) : 51 - 78
  • [6] THERMOELASTICITY OF BODIES WITH MICRO-STRUCTURE
    WOZNIAK, C
    [J]. ARCHIWUM MECHANIKI STOSOWANEJ, 1967, 19 (03): : 335 - &
  • [7] The micro-structure of a sunspot penumbra
    Sanchez Almeida, J.
    [J]. Solar Polarization 4, 2006, 358 : 13 - 18
  • [8] The Micro-structure of Use of Help
    Novick, David G.
    Andrade, Oscar D.
    Bean, Nathaniel
    [J]. SIGDOC'09: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON DESIGN OF COMMUNICATION, 2009, : 97 - 104
  • [9] Volumetric texture synthesis of bone micro-structure as a base for scaffold design
    Holdstein, Y.
    Fischer, A.
    Podshivalov, L.
    Bar-Yoseph, P. Z.
    [J]. SMI 2009: IEEE INTERNATIONAL CONFERENCE ON SHAPE MODELING AND APPLICATIONS, PROCEEDINGS, 2009, : 81 - 88
  • [10] INVERTING POLYURETHANES SYNTHESIS: EFFECTS ON NANO/MICRO-STRUCTURE AND MECHANICAL PROPERTIES
    Fernandez-d'Arlas, B.
    Rueda, L.
    Fernandez, R.
    Khan, U.
    Coleman, J. N.
    Mondragon, I.
    Eceiza, A.
    [J]. SOFT MATERIALS, 2011, 9 (01) : 79 - 93