An auditory-based distortion measure with application to concatenative speech synthesis

被引:13
|
作者
Hansen, JHL [1 ]
Chappell, DT [1 ]
机构
[1] Duke Univ, Dept Elect & Comp Engn, Robust Speech Proc Lab, Durham, NC 27708 USA
来源
关键词
auditory system; distance measurements; spectral analysis; speech synthesis;
D O I
10.1109/89.709674
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study presents a new auditory-based distance measure with application to concatenative speech synthesis. This measure employs the Carney auditory model to produce a feature vector related to auditory perception. For concatenative synthesis, the new measure is employed to assess perceived discontinuities at segment transitions. Evaluations using a restricted data base environment show that the new measure can be effective in improving speech synthesis performance.
引用
收藏
页码:489 / 495
页数:7
相关论文
共 50 条
  • [31] On the relevance of auditory-based Gabor features for deep learning in robust speech recognition
    [J]. Castro Martinez, Angel Mario (angel.castro@uni-oldenburg.de), 1600, Academic Press (45):
  • [32] Allophone-based concatenative speech synthesis system for Russian
    Skrelin, PA
    [J]. TEXT, SPEECH AND DIALOGUE, 1999, 1692 : 156 - 159
  • [33] Discriminative training for concatenative speech synthesis
    Kim, NS
    Park, SS
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (01) : 40 - 43
  • [34] LSM-based boundary training for concatenative speech synthesis
    Bellegarda, Jerome R.
    [J]. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 721 - 724
  • [35] On the relevance. of auditory-based Gabor features for deep learning in robust speech recognition
    Martinez, Angel Mario Castro
    Mallidi, Sri Harish
    Meyer, Bernd T.
    [J]. COMPUTER SPEECH AND LANGUAGE, 2017, 45 : 21 - 38
  • [36] Speech unit selection based on target values driven by speech data in concatenative speech synthesis
    Hirai, T
    Tenpaku, S
    Shikano, K
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 43 - 46
  • [37] Monaural Auditory-Based Unvoiced Speech Segregation Using SNR-Based Subband Spectral Subtraction
    Geravanchizadeh, Masoud
    Dadvar, Paria
    [J]. ACTA ACUSTICA UNITED WITH ACUSTICA, 2014, 100 (02) : 353 - 361
  • [38] Forward masking phenomenon in concatenative speech synthesis
    Cernak, M
    Rozinaj, G
    [J]. PROCEEDINGS EC-VIP-MC 2003, VOLS 1 AND 2, 2003, : 691 - 694
  • [39] A Flexible Architecture for Urdu Phonemes-Based Concatenative Speech Synthesis
    Ahmad, Muhammad Rizwan
    Arshad, Muhammad Junaid
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2016, 35 (03) : 373 - 380
  • [40] Automatic Labeling Schemes for Concatenative Speech Synthesis
    Kacur, Juraj
    Cepko, Jozef
    Palenik, Andrej
    [J]. PROCEEDINGS ELMAR-2008, VOLS 1 AND 2, 2008, : 639 - 642