A new spectral smoothing algorithm for unit concatenating speech synthesis

被引：0

作者：

Kim, SJ

Jang, KA

Han, HB

Hahn, M

机构：

来源：

AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2005年 / 3809卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the Kullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.

引用

页码：550 / 556

页数：7

共 50 条

[1] Smoothing algorithm for contextual phone concatenation in speech synthesis
Yin, Yong
Cao, Zhenhai
Zu, Yiqing
[J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2008, 48 (SUPPL.): : 640 - 644
[2] SPECTRAL SMOOTHING TECHNIQUE IN PARCOR SPEECH ANALYSIS-SYNTHESIS
TOHKURA, Y
ITAKURA, F
HASHIMOTO, S
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1978, 26 (06): : 587 - 596
[3] A New Algorithm for Adaptive Smoothing of Signals in Speech Enhancement
Sunny, Sonia
Peter, David S.
Jacob, K. Poulose
[J]. 2013 INTERNATIONAL CONFERENCE ON ELECTRONIC ENGINEERING AND COMPUTER SCIENCE (EECS 2013), 2013, 4 : 337 - 343
[4] A comparison of spectral smoothing methods for segment concatenation based speech synthesis
Chappell, DT
Hansen, JHL
[J]. SPEECH COMMUNICATION, 2002, 36 (3-4) : 343 - 374
[5] On the Role of Spectral Dynamics in Unit Selection Speech Synthesis
Kirkpatrick, Barry
O'Brien, Darragh
Scaife, Ronan
Errity, Andrew
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2029 - 2032
[6] Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
Low, PH
Vaseghi, S
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 469 - 472
[7] Improvement of synthetic speech quality using a new spectral smoothing technique
Jang, H
Choi, M
Lee, K
Kim, G
Choi, H
[J]. CISST '05: Proceedings of the 2005 International Conference on Imaging Science, Systems, and Technology: Computer Graphics, 2005, : 271 - 277
[8] Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis
Vepa, Jithendra
King, Simon
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (05): : 1763 - 1771
[9] PREDICTING SPECTRAL AND PROSODIC PARAMETERS FOR UNIT SELECTION IN SPEECH SYNTHESIS
Dong, Minghui
Li, Haizhou
[J]. 2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 133 - 136
[10] Speech Smoothing Algorithm Based On MFCC
Wang, Jiali
Lin, Xueyuan
Zhang, Yazhou
Liang, Famai
[J]. ICMS2010: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON MODELLING AND SIMULATION, VOL 1: ENGINEERING COMPUTATION AND FINITE ELEMENT ANALYSIS, 2010, : 286 - 289

← 1 2 3 4 5 →