An embedded English synthesis approach based on speech concatenation and smoothing

被引：0

作者：

Chen, GL ^{[1
]}

Yue, DJ ^{[1
]}

Zu, YQ ^{[1
]}

Yu, ZL ^{[1
]}

机构：

[1] Motorola Labs, China Res Ctr, Shanghai, Peoples R China

来源：

2004 International Symposium on Chinese Spoken Language Processing, Proceedings | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An embedded English synthesis approach based on speech concatenation and smoothing is described. This approach adopts phonetic sub-words as carrier of variable-length units. We define 5-class units to cover all English phonetic phenomena. The corresponding cost function and search procedure based on dynamic programming are addressed in the unit-selection stage. Vocal tract response, pitch value and phase are interpolated and merged at concatenating points for smoothing speech in the synthesis stage. The preliminary test shows that this approach can reach a good balance of naturalness, intelligibility and data footprint.

引用

页码：157 / 160

页数：4

共 50 条

[1] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
Chappell, DT
Hansen, JHL
[J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374
[2] Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules
Imedjdouben F.
[J]. SN Computer Science, 5 (3)
[3] A Wavelet Based Concatenation Algorithm for Gujarati Speech Synthesis
Gujarathi, Priyanka Vishwas
Patil, Sandip Raosaheb
[J]. HELIX, 2020, 10 (05): : 38 - 43
[4] Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method
Tao, Jianhua
Xin, Le
Yin, Panrong
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (03): : 469 - 477
[5] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
Hertz, SR
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
[6] AlpSynth - Concatenation-based speech synthesis for the Slovenian language
Gros, JZ
Mihelic, A
Pavesic, N
Zganec, M
Gruden, S
[J]. Proceedings ELMAR-2005, 2005, : 213 - 216
[7] Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
Sakai, Shinsuke
Kawahara, Tatsuya
Kawai, Hisashi
[J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2006 - 2014
[8] CONCATENATION RULES FOR DEMISYLLABLE SPEECH SYNTHESIS
DETTWEILER, H
HESS, W
[J]. ACUSTICA, 1985, 57 (4-5): : 268 - 283
[9] SYNTHESIS OF ENGLISH MONOSYLLABLES BY DEMISYLLABLE CONCATENATION
LOVINS, JB
FUJIMURA, O
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S75 - S75
[10] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
Jiang, Dongmei
Ravyse, Ilse
Sahli, Hichem
Zhang, Yanning
[J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +

← 1 2 3 4 5 →