An auditory-based distortion measure with application to concatenative speech synthesis

被引:13
|
作者
Hansen, JHL [1 ]
Chappell, DT [1 ]
机构
[1] Duke Univ, Dept Elect & Comp Engn, Robust Speech Proc Lab, Durham, NC 27708 USA
来源
关键词
auditory system; distance measurements; spectral analysis; speech synthesis;
D O I
10.1109/89.709674
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study presents a new auditory-based distance measure with application to concatenative speech synthesis. This measure employs the Carney auditory model to produce a feature vector related to auditory perception. For concatenative synthesis, the new measure is employed to assess perceived discontinuities at segment transitions. Evaluations using a restricted data base environment show that the new measure can be effective in improving speech synthesis performance.
引用
收藏
页码:489 / 495
页数:7
相关论文
共 50 条
  • [11] A Concatenative Synthesis Based Speech Synthesiser for Hindi
    Gupta, Kshitij
    [J]. ADVANCES IN COMPUTER AND INFORMATIOM SCIENCES AND ENGINEERING, 2008, : 261 - 264
  • [12] Auditory-based speech processing based on the average localized synchrony detection
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 1623 - 1626
  • [13] Syllable Based Concatenative Synthesis for Text to Speech Conversion
    Ananthi, S.
    Dhanalakshmi, P.
    [J]. COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
  • [14] Robust classification of stop consonants using auditory-based speech processing
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 81 - 84
  • [15] Robust Auditory-Based Speech Feature Extraction Using Independent Subspace Method
    Wu, Qiang
    Zhang, Liqing
    Xia, Bin
    [J]. ADVANCES IN COGNITIVE NEURODYNAMICS, PROCEEDINGS, 2008, : 405 - +
  • [16] DUAL-CHANNEL ITERATIVE SPEECH ENHANCEMENT WITH CONSTRAINTS ON AN AUDITORY-BASED SPECTRUM
    NANDKUMAR, S
    HANSEN, JHL
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (01): : 22 - 34
  • [17] Robust auditory-based speech processing using the average localized synchrony detection
    Ali, AMA
    Van der Spiegel, J
    Mueller, P
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (05): : 279 - 292
  • [18] Introduction to Multilingual Corpus-Based Concatenative Speech Synthesis
    Deprez, Filip
    Odijk, Jan
    De Moortel, Jan
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 357 - 360
  • [19] SET OF CONCATENATIVE UNITS FOR SPEECH SYNTHESIS
    OLIVE, J
    LIBERMAN, M
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 : S130 - S130
  • [20] AUDITORY DISTORTION MEASURE FOR SPEECH CODER EVALUATION - DISCRIMINATION INFORMATION APPROACH
    DE, A
    KABAL, P
    [J]. SPEECH COMMUNICATION, 1994, 14 (03) : 205 - 229