A new spectral smoothing algorithm for unit concatenating speech synthesis

被引:0
|
作者
Kim, SJ
Jang, KA
Han, HB
Hahn, M
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the Kullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.
引用
收藏
页码:550 / 556
页数:7
相关论文
共 50 条
  • [1] Smoothing algorithm for contextual phone concatenation in speech synthesis
    Yin, Yong
    Cao, Zhenhai
    Zu, Yiqing
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2008, 48 (SUPPL.): : 640 - 644
  • [2] SPECTRAL SMOOTHING TECHNIQUE IN PARCOR SPEECH ANALYSIS-SYNTHESIS
    TOHKURA, Y
    ITAKURA, F
    HASHIMOTO, S
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (06): : 587 - 596
  • [3] A New Algorithm for Adaptive Smoothing of Signals in Speech Enhancement
    Sunny, Sonia
    Peter, David S.
    Jacob, K. Poulose
    [J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRONIC ENGINEERING AND COMPUTER SCIENCE (EECS 2013), 2013, 4 : 337 - 343
  • [4] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    Chappell, DT
    Hansen, JHL
    [J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374
  • [5] On the Role of Spectral Dynamics in Unit Selection Speech Synthesis
    Kirkpatrick, Barry
    O'Brien, Darragh
    Scaife, Ronan
    Errity, Andrew
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2029 - 2032
  • [6] Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
    Low, PH
    Vaseghi, S
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 469 - 472
  • [7] Improvement of synthetic speech quality using a new spectral smoothing technique
    Jang, H
    Choi, M
    Lee, K
    Kim, G
    Choi, H
    [J]. CISST '05: Proceedings of the 2005 International Conference on Imaging Science, Systems, and Technology: Computer Graphics, 2005, : 271 - 277
  • [8] Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis
    Vepa, Jithendra
    King, Simon
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1763 - 1771
  • [9] PREDICTING SPECTRAL AND PROSODIC PARAMETERS FOR UNIT SELECTION IN SPEECH SYNTHESIS
    Dong, Minghui
    Li, Haizhou
    [J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 133 - 136
  • [10] Speech Smoothing Algorithm Based On MFCC
    Wang, Jiali
    Lin, Xueyuan
    Zhang, Yazhou
    Liang, Famai
    [J]. ICMS2010: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, VOL 1: ENGINEERING COMPUTATION AND FINITE ELEMENT ANALYSIS, 2010, : 286 - 289