A new spectral smoothing algorithm for unit concatenating speech synthesis

被引：0

作者：

Kim, SJ

Jang, KA

Han, HB

Hahn, M

机构：

来源：

AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE | 2005年 / 3809卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the Kullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.

引用

页码：550 / 556

页数：7

共 50 条

[11] Setup for Acoustic-Visual Speech Synthesis by Concatenating Bimodal Units
Toutios, Asterios
Musti, Utpala
Ouni, Slim
Colotte, Vincent
Wrobel-Dautcourt, Brigitte
Berger, Marie-Odile
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 486 - 489
[12] Unit selection algorithm for Japanese speech synthesis based on both phoneme unit and diphone unit
Toda, T
Kawai, H
Tsuzaki, M
Shikano, K
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 465 - 468
[13] APPLICATIONS OF A NONLINEAR SMOOTHING ALGORITHM TO SPEECH PROCESSING
RABINER, LR
SAMBUR, MR
SCHMIDT, CE
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1975, 23 (06): : 552 - 557
[14] Sinusoidal plus all-pole modification based spectral smoothing for concatenative speech synthesis
Kang, H
Liu, WJ
[J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 194 - 198
[15] AN ITERATIVE ALGORITHM FOR SPECTRAL ESTIMATION WITH SPATIAL SMOOTHING
Blasinski, Henryk
Farrell, Joyce
Wandell, Brian
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 936 - 940
[16] Application of Genetic Algorithm in unit selection for Malay speech synthesis system
Lim, Yee Chea
Tan, Tian Swee
Hussain, Sheikh
Salleh, Shaikh
Ling, Dandy Kwong
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5376 - 5383
[17] Temporal smoothing of spectral masks in the cepstral domain for speech separation
Madhu, Nilesh
Breithaupt, Colin
Martin, Rainer
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 45 - 48
[18] Accurate visible speech synthesis based on concatenating variable length motion capture data
Ma, JY
Cole, R
Pellom, B
Ward, W
Wise, B
[J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (02) : 266 - 276
[19] Smoothing the acoustic spectral time series of speech signals for noise reduction
Chen, Yan-Tong
Lin, Jian-Yu
Liu, Kuan-Yi
Hung, Jeih-weih
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TW), 2019,
[20] The effect of smoothing filter slope and spectral frequency on temporal speech information
Healy, Eric W.
Steinbach, Heidi M.
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (02): : 1177 - 1181

← 1 2 3 4 5 →