An embedded English synthesis approach based on speech concatenation and smoothing

被引:0
|
作者
Chen, GL [1 ]
Yue, DJ [1 ]
Zu, YQ [1 ]
Yu, ZL [1 ]
机构
[1] Motorola Labs, China Res Ctr, Shanghai, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An embedded English synthesis approach based on speech concatenation and smoothing is described. This approach adopts phonetic sub-words as carrier of variable-length units. We define 5-class units to cover all English phonetic phenomena. The corresponding cost function and search procedure based on dynamic programming are addressed in the unit-selection stage. Vocal tract response, pitch value and phase are interpolated and merged at concatenating points for smoothing speech in the synthesis stage. The preliminary test shows that this approach can reach a good balance of naturalness, intelligibility and data footprint.
引用
收藏
页码:157 / 160
页数:4
相关论文
共 50 条
  • [1] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    Chappell, DT
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374
  • [2] Speech Processing for Arabic Speech Synthesis Based on Concatenation Rules
    Imedjdouben F.
    [J]. SN Computer Science, 5 (3)
  • [3] A Wavelet Based Concatenation Algorithm for Gujarati Speech Synthesis
    Gujarathi, Priyanka Vishwas
    Patil, Sandip Raosaheb
    [J]. HELIX, 2020, 10 (05): : 38 - 43
  • [4] Realistic Visual Speech Synthesis Based on Hybrid Concatenation Method
    Tao, Jianhua
    Xin, Le
    Yin, Panrong
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (03): : 469 - 477
  • [5] Integration of rule-based formant synthesis and waveform concatenation: A hybrid approach to text-to-speech synthesis
    Hertz, SR
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 87 - 90
  • [6] AlpSynth - Concatenation-based speech synthesis for the Slovenian language
    Gros, JZ
    Mihelic, A
    Pavesic, N
    Zganec, M
    Gruden, S
    [J]. Proceedings ELMAR-2005, 2005, : 213 - 216
  • [7] Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Kawai, Hisashi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2006 - 2014
  • [8] CONCATENATION RULES FOR DEMISYLLABLE SPEECH SYNTHESIS
    DETTWEILER, H
    HESS, W
    [J]. ACUSTICA, 1985, 57 (4-5): : 268 - 283
  • [9] SYNTHESIS OF ENGLISH MONOSYLLABLES BY DEMISYLLABLE CONCATENATION
    LOVINS, JB
    FUJIMURA, O
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 : S75 - S75
  • [10] Accurate Visual Speech Synthesis Based on Diviseme Unit Selection and Concatenation
    Jiang, Dongmei
    Ravyse, Ilse
    Sahli, Hichem
    Zhang, Yanning
    [J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 910 - +