Feature Extraction for Spectral Continuity Measures in Concatenative Speech Synthesis

被引:0
|
作者
Kirkpatrick, Barry [1 ]
O'Brien, Darragh [1 ]
Scaife, Ronan [1 ]
机构
[1] Dublin City Univ, Fac Engn & Comp, Dublin 9, Ireland
关键词
speech synthesis; unit selection; join cost; wavelet transform; phase spectra;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The quality of concatenative speech synthesis depends on the cost function employed for unit selection. Effective cost functions for spectral continuity are difficult to define and standard measures often do not accurately reflect human perception of discontinuity across a concatenated join. In this study the performance of a number of standard distance measures are compared for the task of detecting audible discontinuities in concatenated speech. Feature sets derived from. the phase spectrum are also investigated. Feature extraction based on wavelet analysis is proposed to overcome some of the limitations of the standard measures tested. Receiver Operating Characteristic (ROC) curves are constructed for each measure from the results of a perceptual experiment and are used to rank the performance of each measure. Results indicate that phase spectra is comparable to magnitude spectra as a join cost for spectral continuity. Measures based on wavelet transform coefficients outperform all other measures tested.
引用
收藏
页码:1742 / 1745
页数:4
相关论文
共 50 条
  • [11] Forward masking phenomenon in concatenative speech synthesis
    Cernak, M
    Rozinaj, G
    [J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
  • [12] Automatic Labeling Schemes for Concatenative Speech Synthesis
    Kacur, Juraj
    Cepko, Jozef
    Palenik, Andrej
    [J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642
  • [13] A Concatenative Synthesis Based Speech Synthesiser for Hindi
    Gupta, Kshitij
    [J]. ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
  • [14] Sinusoidal plus all-pole modification based spectral smoothing for concatenative speech synthesis
    Kang, H
    Liu, WJ
    [J]. Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 194 - 198
  • [15] Acoustic speech unit segmentation for concatenative synthesis
    Torres, H. M.
    Gurlekian, J. A.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2008, 22 (02): : 196 - 206
  • [16] Nonlinear speech features for the objective detection of discontinuities in concatenative speech synthesis
    Pantazis, Y
    Stylianou, Y
    [J]. NONLINEAR SPEECH MODELING AND APPLICATIONS, 2005, 3445 : 375 - 383
  • [17] Context-adaptive smoothing for concatenative speech synthesis
    Lee, KS
    Kim, SR
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (12) : 422 - 425
  • [18] Speech synthesis for text-to-speech alignment and prosodic feature extraction
    Malfrere, F
    Dutoit, T
    [J]. ISCAS '97 - PROCEEDINGS OF 1997 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I - IV: CIRCUITS AND SYSTEMS IN THE INFORMATION AGE, 1997, : 2637 - 2640
  • [19] The phase substitutions in Czech harmonic concatenative speech synthesis
    Tychtl, Z
    Matous, K
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2003, 2807 : 333 - 340
  • [20] Lipschitz continuity of spectral measures
    Ricker, WJ
    [J]. BULLETIN OF THE AUSTRALIAN MATHEMATICAL SOCIETY, 1999, 59 (03) : 369 - 373