Evaluating Prosodic Processing for Incremental Speech Synthesis

被引:0
|
作者
Baumann, Timo [1 ]
Schlangen, David [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
关键词
speech synthesis; spoken dialogue systems; incrementality; prosody;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incremental speech synthesis (iSS) accepts input and produces output in consecutive chunks that only together result in a full utterance. Systems that use iSS thus have the ability to adapt their utterances while they are ongoing. However, starting to process with less than the full utterance available prohibits global optimization, leading to potentially suboptimal solutions. In this paper, we present a method for incrementalizing the symbolic pre-processing component of speech synthesis and assess the influence of varying "lookahead", i. e. knowledge about the rest of the utterance, on prosodic quality. We found that high quality incremental output can be achieved even with a lookahead of less than one phrase, allowing for timely system reaction.
引用
收藏
页码:438 / 441
页数:4
相关论文
共 50 条
  • [1] Prosodic Processing for the Automatic Synthesis of Emotional Russian Speech
    Kaliyev, Arman
    Matveev, Yuri N.
    Lyakso, Elena E.
    Rybin, Sergey V.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE QUALITY MANAGEMENT, TRANSPORT AND INFORMATION SECURITY, INFORMATION TECHNOLOGIES (IT&QM&IS), 2018, : 653 - 655
  • [2] The incremental processing of focus, givenness and prosodic prominence
    Baumann, Stefan
    Schumacher, Petra B.
    [J]. GLOSSA-A JOURNAL OF GENERAL LINGUISTICS, 2020, 5 (01):
  • [3] Prosodic and syntactic speech processing and their interplay
    Eckstein, K
    Friederici, AD
    [J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, : 94 - 94
  • [4] Speech and prosodic processing for assistive technology
    Narupiyakul, Lalita
    Keselj, Vlado
    Cercone, Nick
    Sirinaovakul, Booncharoen
    [J]. Frontiers in Artificial Intelligence and Applications, 2013, 253 : 36 - 48
  • [5] PROSODIC MODELING IN SWEDISH SPEECH SYNTHESIS
    BRUCE, G
    GRANSTROM, B
    [J]. SPEECH COMMUNICATION, 1993, 13 (1-2) : 63 - 73
  • [6] Learning prosodic patterns for mandarin speech synthesis
    Chen, YQ
    Gao, W
    Zhu, TS
    Ling, C
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 19 (01) : 95 - 109
  • [7] Learning Prosodic Patterns for Mandarin Speech Synthesis
    Yiqiang Chen
    Wen Gao
    Tingshao Zhu
    Charles Ling
    [J]. Journal of Intelligent Information Systems, 2002, 19 : 95 - 109
  • [8] SPEECH SYNTHESIS USING SEGMENTAL AND PROSODIC PHONEMES
    MANDURAH, MM
    [J]. JOURNAL OF ENGINEERING SCIENCES, 1985, 11 (01): : 79 - 90
  • [9] Prediction of abstract prosodic labels for speech synthesis
    Ross, K
    Ostendorf, M
    [J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (03): : 155 - 185
  • [10] Stress and prosodic constituency in French: issues in phonology and speech processing
    Astesano, Corine
    Bertrand, Roxane
    [J]. LANGUE FRANCAISE, 2016, (191): : 11 - +