Evaluating Prosodic Processing for Incremental Speech Synthesis

被引：0

作者：

Baumann, Timo ^{[1
]}

Schlangen, David ^{[1
]}

机构：

[1] Univ Hamburg, Dept Informat, Hamburg, Germany

来源：

13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年

关键词：

speech synthesis; spoken dialogue systems; incrementality; prosody;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Incremental speech synthesis (iSS) accepts input and produces output in consecutive chunks that only together result in a full utterance. Systems that use iSS thus have the ability to adapt their utterances while they are ongoing. However, starting to process with less than the full utterance available prohibits global optimization, leading to potentially suboptimal solutions. In this paper, we present a method for incrementalizing the symbolic pre-processing component of speech synthesis and assess the influence of varying "lookahead", i. e. knowledge about the rest of the utterance, on prosodic quality. We found that high quality incremental output can be achieved even with a lookahead of less than one phrase, allowing for timely system reaction.

引用

页码：438 / 441

页数：4

共 50 条

[1] Prosodic Processing for the Automatic Synthesis of Emotional Russian Speech
Kaliyev, Arman
Matveev, Yuri N.
Lyakso, Elena E.
Rybin, Sergey V.
[J]. 2018 IEEE INTERNATIONAL CONFERENCE QUALITY MANAGEMENT, TRANSPORT AND INFORMATION SECURITY, INFORMATION TECHNOLOGIES (IT&QM&IS), 2018, : 653 - 655
[2] The incremental processing of focus, givenness and prosodic prominence
Baumann, Stefan
Schumacher, Petra B.
[J]. GLOSSA-A JOURNAL OF GENERAL LINGUISTICS, 2020, 5 (01):
[3] Prosodic and syntactic speech processing and their interplay
Eckstein, K
Friederici, AD
[J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, : 94 - 94
[4] Speech and prosodic processing for assistive technology
Narupiyakul, Lalita
Keselj, Vlado
Cercone, Nick
Sirinaovakul, Booncharoen
[J]. Frontiers in Artificial Intelligence and Applications, 2013, 253 : 36 - 48
[5] PROSODIC MODELING IN SWEDISH SPEECH SYNTHESIS
BRUCE, G
GRANSTROM, B
[J]. SPEECH COMMUNICATION, 1993, 13 (1-2) : 63 - 73
[6] Learning prosodic patterns for mandarin speech synthesis
Chen, YQ
Gao, W
Zhu, TS
Ling, C
[J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2002, 19 (01) : 95 - 109
[7] Learning Prosodic Patterns for Mandarin Speech Synthesis
Yiqiang Chen
Wen Gao
Tingshao Zhu
Charles Ling
[J]. Journal of Intelligent Information Systems, 2002, 19 : 95 - 109
[8] SPEECH SYNTHESIS USING SEGMENTAL AND PROSODIC PHONEMES
MANDURAH, MM
[J]. JOURNAL OF ENGINEERING SCIENCES, 1985, 11 (01): : 79 - 90
[9] Prediction of abstract prosodic labels for speech synthesis
Ross, K
Ostendorf, M
[J]. COMPUTER SPEECH AND LANGUAGE, 1996, 10 (03): : 155 - 185
[10] Stress and prosodic constituency in French: issues in phonology and speech processing
Astesano, Corine
Bertrand, Roxane
[J]. LANGUE FRANCAISE, 2016, (191): : 11 - +

← 1 2 3 4 5 →