Frequency-warping in speech

被引:0
|
作者
Umesh, S
Cohen, L
Marinovic, N
Nelson, D
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present results that indicate that the formant frequencies between different speakers scale differently at different frequencies. Based on our experiments on speech data, we then numerically compute a universal frequency-warping function, to make the scale-factor independent of frequency in the warped domain. The proposed warping function is found to be similar to the mel-scale, which has previously been derived from purely psycho-acoustic experiments. The motivation for the present experiments stems from our recently proposed use of scale-transform based cepstral coefficients [6] as acoustic features, since they provide superior separability of vowels than mel-cepstral coefficients.
引用
收藏
页码:414 / 417
页数:4
相关论文
共 50 条
  • [1] Frequency-warping invariant features for automatic speech recognition
    Mertins, Alfred
    Rademacher, Jan
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 5883 - 5886
  • [2] Frequency-warping and speaker-normalization
    Umesh, S
    Cohen, L
    Nelson, D
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 983 - 986
  • [3] Psychoacoustic frequency-scales versus frequency-warping in scale-cepstrum
    Umesh, S
    Cohen, L
    Marinovic, N
    Nelson, D
    WAVELET APPLICATIONS IN SIGNAL AND IMAGE PROCESSING IV, PTS 1 AND 2, 1996, 2825 : 530 - 539
  • [4] THE SHORT-TIME BEHAVIOR OF A FREQUENCY-WARPING POWER SPECTRAL ESTIMATOR
    GILCHRIST, JH
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1980, 28 (02): : 176 - 183
  • [5] A frequency-warping approach to speaker normalization (vol 6, pg 49, 1998)
    Lee, L
    Rose, RC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (02): : 195 - 195
  • [6] Reducing the dispersion error in the digital waveguide mesh using interpolation and frequency-warping techniques
    Savioja, L
    Välimäki, V
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (02): : 184 - 194
  • [7] Speech-Signal-Based Frequency Warping
    Paliwal, Kuldip
    Shannon, Benjamin
    Lyons, James
    Wojcicki, Kamil
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (04) : 319 - 322
  • [8] TRANSLATION OF DIVERS SPEECH USING DIGITAL FREQUENCY WARPING
    ZUE, V
    OPPENHEI.A
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (01): : 131 - &
  • [9] A novel frequency warping scale for speech emotion recognition
    Singh, Premjeet
    Saha, Goutam
    INTERSPEECH 2023, 2023, : 3647 - 3651
  • [10] DYNAMIC FREQUENCY WARPING FOR SPEAKER ADAPTATION IN AUTOMATIC SPEECH RECOGNITION
    PALIWAL, KK
    AINSWORTH, WA
    JOURNAL OF PHONETICS, 1985, 13 (02) : 123 - 134