An auditory-based distortion measure with application to concatenative speech synthesis

被引:13
|
作者
Hansen, JHL [1 ]
Chappell, DT [1 ]
机构
[1] Duke Univ, Dept Elect & Comp Engn, Robust Speech Proc Lab, Durham, NC 27708 USA
来源
关键词
auditory system; distance measurements; spectral analysis; speech synthesis;
D O I
10.1109/89.709674
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This study presents a new auditory-based distance measure with application to concatenative speech synthesis. This measure employs the Carney auditory model to produce a feature vector related to auditory perception. For concatenative synthesis, the new measure is employed to assess perceived discontinuities at segment transitions. Evaluations using a restricted data base environment show that the new measure can be effective in improving speech synthesis performance.
引用
收藏
页码:489 / 495
页数:7
相关论文
共 50 条
  • [1] Speech distortion measure based on auditory properties
    CHEN Guo
    HU Xiulin
    ZHANG Yunyu
    ZHU Yaoting (Department of Electronics and Information Engineering
    [J]. Chinese Journal of Acoustics, 2000, (04) : 339 - 345
  • [2] SPEECH SEGMENT SELECTION FOR CONCATENATIVE SYNTHESIS BASED ON SPECTRAL DISTORTION MINIMIZATION
    IWAHASHI, N
    KAIKI, N
    SAGISAKA, Y
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1993, E76A (11) : 1942 - 1948
  • [3] Application of an Auditory-Based Feedback Distortion to Modify Gait Symmetry in Healthy Individuals
    Liu, Le Yu
    Sangani, Samir
    Patterson, Kara K.
    Fung, Joyce
    Lamontagne, Anouk
    [J]. BRAIN SCIENCES, 2024, 14 (08)
  • [4] Speech Enhancement Using Auditory-Based Transform
    Tank, Vanita Raj
    Mahajan, S. P.
    Khaparde, Arti
    Deshpande, Rahul
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [5] AN AUDITORY-BASED FEATURE FOR ROBUST SPEECH RECOGNITION
    Shao, Yang
    Jin, Zhaozhang
    Wang, DeLiang
    Srinivasan, Soundararajan
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4625 - +
  • [6] A modified Itakura speech distortion measure based on auditory properties
    Chen, G
    Hu, XL
    Zhang, YY
    Zhu, YT
    [J]. APPLIED ACOUSTICS, 2001, 62 (05) : 545 - 553
  • [7] An auditory-based measure for improved phone segment concatenation
    Chappell, DT
    Hansen, JHL
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 1639 - 1642
  • [8] Discriminative auditory-based features for robust speech recognition
    Mak, BKW
    Tam, YC
    Li, PQ
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (01): : 27 - 36
  • [9] Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
    Plourde, Eric
    Champagne, Benoit
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1614 - 1623
  • [10] Speech segment selection for concatenative synthesis based on prosody-aligned distance measure
    Kuo, CC
    Kuo, CS
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 473 - 476