A new spectral smoothing algorithm for unit concatenating speech synthesis

被引:0
|
作者
Kim, SJ
Jang, KA
Han, HB
Hahn, M
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the Kullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.
引用
收藏
页码:550 / 556
页数:7
相关论文
共 50 条
  • [11] Setup for Acoustic-Visual Speech Synthesis by Concatenating Bimodal Units
    Toutios, Asterios
    Musti, Utpala
    Ouni, Slim
    Colotte, Vincent
    Wrobel-Dautcourt, Brigitte
    Berger, Marie-Odile
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 486 - 489
  • [12] Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit
    Toda, T
    Kawai, H
    Tsuzaki, M
    Shikano, K
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 465 - 468
  • [13] APPLICATIONS OF A NONLINEAR SMOOTHING ALGORITHM TO SPEECH PROCESSING
    RABINER, LR
    SAMBUR, MR
    SCHMIDT, CE
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, 23 (06): : 552 - 557
  • [14] Sinusoidal plus all-pole modification based spectral smoothing for concatenative speech synthesis
    Kang, H
    Liu, WJ
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 194 - 198
  • [15] AN ITERATIVE ALGORITHM FOR SPECTRAL ESTIMATION WITH SPATIAL SMOOTHING
    Blasinski, Henryk
    Farrell, Joyce
    Wandell, Brian
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 936 - 940
  • [16] Application of Genetic Algorithm in unit selection for Malay speech synthesis system
    Lim, Yee Chea
    Tan, Tian Swee
    Hussain, Sheikh
    Salleh, Shaikh
    Ling, Dandy Kwong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5376 - 5383
  • [17] Temporal smoothing of spectral masks in the cepstral domain for speech separation
    Madhu, Nilesh
    Breithaupt, Colin
    Martin, Rainer
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 45 - 48
  • [18] Accurate visible speech synthesis based on concatenating variable length motion capture data
    Ma, JY
    Cole, R
    Pellom, B
    Ward, W
    Wise, B
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (02) : 266 - 276
  • [19] Smoothing the acoustic spectral time series of speech signals for noise reduction
    Chen, Yan-Tong
    Lin, Jian-Yu
    Liu, Kuan-Yi
    Hung, Jeih-weih
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
  • [20] The effect of smoothing filter slope and spectral frequency on temporal speech information
    Healy, Eric W.
    Steinbach, Heidi M.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (02): : 1177 - 1181