Wavelet transforms for speech signal processing

被引:4
|
作者
Wang, JF [1 ]
Chen, SH
Shyuu, JS
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 701, Taiwan
[2] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
关键词
wavelet transform; pitch detection; C/V segmentation; speech recognition;
D O I
10.1080/02533839.1999.9670493
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The wavelet transform and its theory is one of the most exciting developments of the last decade. In fact, the wavelet transform has been developed independently for various different fields such as signal processing, image processing, audio and speech processing, communication, and mathematics. Due to the efficient time-frequency localization and the multiresolution characteristics of the wavelet representations, the wavelet transforms are quite suitable for processing non-stationary signals such as speech. In this paper, the wavelet transform and its theory will be first introduced, then comparisons between the wavelet transform and the classical short-time Fourier transform approach to signal analysis will be provided. In addition, applying wavelet transforms in determining pitch, and segmenting consonant / vowel (C/V) parts as well as speech recognition will be discussed in this paper.
引用
收藏
页码:549 / 560
页数:12
相关论文
共 50 条
  • [31] Standards of speech technologies and recognition with Fourier and Wavelet Transforms
    Karam, J
    [J]. MLMTA '05: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MACHINE LEARNING MODELS TECHNOLOGIES AND APPLICATIONS, 2005, : 177 - 183
  • [32] A Speech Endpoint Detection Algorithm Based on Wavelet Transforms
    Cao Yali
    La Dongsheng
    Jia Shuo
    Niu Xuefen
    [J]. 26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 3010 - 3012
  • [33] Fourier Transforms in Digital Signal Processing
    Kang, Bai
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT AND INFORMATION TECHNOLOGY, 2015, 35 : 814 - 818
  • [34] Multichannel transforms for signal/image processing
    Pitas, I
    Karasaridis, A
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 1996, 5 (10) : 1402 - 1413
  • [35] Optical Wavelet Signal Processing
    Ben-Ezra, Y.
    Lembrikov, B. I.
    [J]. ICTON: 2009 11TH INTERNATIONAL CONFERENCE ON TRANSPARENT OPTICAL NETWORKS, VOLS 1 AND 2, 2009, : 973 - 976
  • [36] SPEECH PROCESSING WITH WALSH-HADAMARD TRANSFORMS
    SHUM, FYY
    ELLIOTT, AR
    BROWN, WO
    [J]. IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (03): : 174 - 179
  • [37] ADAPTED LOCAL TRIGONOMETRIC TRANSFORMS AND SPEECH PROCESSING
    WESFREID, E
    WICKERHAUSER, MV
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1993, 41 (12) : 3596 - 3600
  • [38] Performance analysis of audio signal compression based on wavelet and wavelet packet transforms
    Lim, BL
    Ying, ZL
    [J]. ICICS - PROCEEDINGS OF 1997 INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING, VOLS 1-3: THEME: TRENDS IN INFORMATION SYSTEMS ENGINEERING AND WIRELESS MULTIMEDIA COMMUNICATIONS, 1997, : 735 - 739
  • [39] Wavelet transforms for non-uniform speech recognition systems
    Janer, L
    Marti, J
    Nadeu, C
    LleidaSolano, E
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 2348 - 2351
  • [40] Processing and recognition of the thermal images using wavelet transforms
    Kosikowski, Mateusz
    Suszynski, Zbigniew
    Bednarek, Michal
    [J]. MICROELECTRONICS RELIABILITY, 2011, 51 (07) : 1271 - 1275