Synthesis fidelity and time-varying spectral change in vowels

被引:36
|
作者
Assmann, PF
Katz, WF
机构
[1] Univ Texas, Sch Behav & Brain Sci, Richardson, TX 75083 USA
[2] Univ Texas, Callier Ctr Commun Disorders, Richardson, TX 75083 USA
来源
关键词
D O I
10.1121/1.1852549
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent studies have shown that synthesized versions of American English vowels are less accurately identified when the natural time-varying spectral changes are eliminated by holding the formant frequencies constant over the duration of the vowel. A limitation of these experiments has been that vowels produced by formant synthesis are generally less accurately identified than the natural vowels after which they are modeled. To overcome this limitation, a high-quality speech analysis-synthesis system (STRAIGHT) was used to synthesize versions of 12 American English vowels spoken by adults and children. Vowels synthesized with STRAIGHT were identified as accurately as the natural versions, in contrast with previous results from our laboratory showing identification rates 9 %-12 % lower for the same vowels synthesized using the cascade formant model. Consistent with earlier studies, identification accuracy was not reduced when the fundamental frequency was held constant across the vowel. However, elimination of time-varying changes in the spectral envelope using STRAIGHT led to a greater reduction in accuracy (23 %) than was previously found with cascade formant synthesis (11 %). A statistical pattern recognition model, applied to acoustic measurements of the natural and synthesized vowels, predicted both the higher identification accuracy for vowels synthesized using STRAIGHT compared to formant synthesis, and the greater effects of holding the formant frequencies constant over time with STRAIGHT synthesis. Taken together, the experiment and modeling results suggest that formant estimation errors and incorrect rendering of spectral and temporal cues by cascade formant synthesis contribute to lower identification accuracy and underestimation of the role of time-varying spectral change in vowels. (C) 2005 Acoustical Society of America.
引用
收藏
页码:886 / 895
页数:10
相关论文
共 50 条
  • [41] An HDR Spectral Imaging System for Time-Varying Omnidirectional Scene
    Hirai, Keita
    Osawa, Naoto
    Horiuchi, Takahiko
    Tominaga, Shoji
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2059 - 2064
  • [42] Time-varying interference spectral analysis for Cognitive UWB networks
    Francone, Massimo
    Domenicali, Daniele
    Di Benedetto, Maria-Gabriella
    IECON 2006 - 32ND ANNUAL CONFERENCE ON IEEE INDUSTRIAL ELECTRONICS, VOLS 1-11, 2006, : 4704 - +
  • [43] Time-varying spectral characteristics of ENSO over the Last Millennium
    Pandora Hope
    Benjamin J. Henley
    Joelle Gergis
    Josephine Brown
    Hua Ye
    Climate Dynamics, 2017, 49 : 1705 - 1727
  • [44] Estimation of Time-Varying Spectral Peaks and Decomposition of EEG Spectrograms
    Stokes, Patrick A.
    Prerau, Michael J.
    IEEE ACCESS, 2020, 8 (218257-218278) : 218257 - 218278
  • [45] Spectral characterization of feedback linear periodically time-varying systems
    Mosquera, C
    Scalise, S
    Taricco, G
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1209 - 1212
  • [46] Time-varying complex spectral analysis via recursive APES
    Wu, R
    Liu, ZS
    Li, J
    IEE PROCEEDINGS-RADAR SONAR AND NAVIGATION, 1998, 145 (06) : 354 - 360
  • [47] Time-varying Spectral Entropy Based Analysis of Impulse Noises
    Singh, Neelima
    Lall, Brejesh
    2019 IEEE 30TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2019, : 1534 - 1539
  • [48] Time-varying spectral characteristics of ENSO over the Last Millennium
    Hope, Pandora
    Henley, Benjamin J.
    Gergis, Joelle
    Brown, Josephine
    Ye, Hua
    CLIMATE DYNAMICS, 2017, 49 (5-6) : 1705 - 1727
  • [49] Indexical properties influence time-varying amplitude and fundamental frequency contributions of vowels to sentence intelligibility
    Fogerty, Daniel
    JOURNAL OF PHONETICS, 2015, 52 : 89 - 104
  • [50] TIME-VARYING LYAPUNOV FUNCTIONS FOR LINEAR TIME-VARYING SYSTEMS
    RAMARAJAN, S
    INTERNATIONAL JOURNAL OF CONTROL, 1986, 44 (06) : 1699 - 1702