A new spectral smoothing algorithm for unit concatenating speech synthesis

被引:0
|
作者
Kim, SJ
Jang, KA
Han, HB
Hahn, M
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the Kullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.
引用
收藏
页码:550 / 556
页数:7
相关论文
共 50 条
  • [41] A new algorithm of speech camouflage
    Niu, XX
    Yang, YX
    [J]. CHINESE JOURNAL OF ELECTRONICS, 2004, 13 (03) : 491 - 495
  • [42] Speech Enhancement Algorithm Based on Improved Spectral Subtraction
    Gao, Liuyang
    Guo, Yunfei
    Li, Shaomei
    Chen, Fucai
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTELLIGENT SYSTEMS, PROCEEDINGS, VOL 3, 2009, : 140 - 143
  • [43] An Improved Spectral Subtraction Algorithm for Speech Enhancement System
    Na, Shun
    Li, Weixing
    Liu, Yang
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INFORMATION ENGINEERING FOR MECHANICS AND MATERIALS, 2016, 97 : 318 - 323
  • [44] A recursive parametric spectral subtraction algorithm for speech enhancement
    You, Ming-Chan
    Mao, Cheng-Yi
    Wang, Jeen-Shing
    Chuang, Fang-Chen
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 826 - +
  • [45] Unit Selection Model in Arabic Speech Synthesis
    Al-Saiyd, Nedhal A.
    Hijjawi, Mohammad
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (04): : 126 - 131
  • [46] Acoustic speech unit segmentation for concatenative synthesis
    Torres, H. M.
    Gurlekian, J. A.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
  • [47] Syllable as the Basic Unit for Kannada Speech Synthesis
    Geeta, Sai
    Muralidhara, B. L.
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1205 - 1208
  • [48] Control of spectral dynamics in concatenative speech synthesis
    Wouters, J
    Macon, MW
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (01): : 30 - 38
  • [49] Glottal Spectral Separation for Parametric Speech Synthesis
    Cabral, Joao P.
    Renals, Steve
    Richmond, Korin
    Yamagishi, Junichi
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 1829 - 1832
  • [50] SPEECH SYNTHESIS BY DYADIC INTERPOLATION OF SPECTRAL PARAMETERS
    SHADLE, C
    ATAL, B
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1977, 62 : S62 - S62