Low complexity scalable perceptual audio coder using an optimum wavelet packet basis representation and vector quantization

被引:0
|
作者
Sathidevi, PS [1 ]
Venkataramani, Y [1 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Kerala, India
关键词
wavelet packets; psychoacoustic model; audio compression; scalability; vector quantization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we describe a high quality low complexity scalable audio coding scheme, using an optimum wavelet packet (WP) basis signal representation based on the time varying characteristics of the audio signal. In ISO/MPEG audio coding standards [1-3], resolution of decomposition filterbank (uniform) does not match with the resolution of psychoacoustic model (which requires more resolution and needs to be matched with the critical bands (non uniform) of cochlea). Hence MPEG coder uses a separate high resolution decomposition filterbank for,psychoacoustic model implementation, which increases the computational load of the coder. Here, we use a wavelet packet decomposition structure closely matching to the critical bands [4,5] of human auditory system, to transform the data into wavelet domain and then these wavelet packet coefficients are used to drive the psychoacoustic model directly. Hence, psychoacoustic model design is integrated with the design of decomposition filterbank. Other features of the proposed coder are scalability (can support three standard industrial sampling frequencies 11.025 kHz, 22.050 kHz and 44.1 kHz) and optimum wavelet basis selection from a predefined library of wavelet bases, by extracting seven statistical features of the audio signal to be encoded. A new Vector Quantization (VQ) scheme is also proposed here, in which the length of the code book can be varied in accordance with the psychoacoustic model requirement. Experimental results show that the proposed coder yields almost transparent quality with compression ratios in the range of 6 to 10.
引用
收藏
页码:399 / 407
页数:9
相关论文
共 39 条
  • [21] Classification of Pathological and Healthy Voice Using Perceptual Wavelet Packet Decomposition and Support Vector Machine
    Arslan, Ozkan
    [J]. 2020 MEDICAL TECHNOLOGIES CONGRESS (TIPTEKNO), 2020,
  • [22] Gait Recognition Using Wavelet Packet Silhouette Representation and Transductive Support Vector Machines
    Dadashi, Farzin
    Araabi, Babak N.
    Soltanian-Zadeh, Hamid
    [J]. PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, VOLS 1-9, 2009, : 1283 - 1287
  • [23] LOW BIT-RATE VIDEO CODING USING WAVELET VECTOR QUANTIZATION
    SAMPSON, DG
    DASILVA, EAB
    GHANBARI, M
    [J]. IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1995, 142 (03): : 141 - 148
  • [24] Hybrid Low Bitrate Audio Coding Using Adaptive Gain Shape Vector Quantization
    Mehrotra, Sanjeev
    Chen, Wei-ge
    Koishida, Kazuhito
    Thumpudi, Naveen
    [J]. 2008 IEEE 10TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, VOLS 1 AND 2, 2008, : 931 - 936
  • [25] A method to compress vibration signals using wavelet packet transformation combined with sub-band vector quantization
    Weng, Hao
    Gao, Jinji
    Jiang, Zhinong
    [J]. High Technology Letters, 2013, 19 (04) : 443 - 448
  • [26] A method to compress vibration signals using wavelet packet transformation combined with sub-band vector quantization
    翁浩
    Gao Jinji
    Jiang Zhinong
    [J]. High Technology Letters, 2013, 19 (04) : 443 - 448
  • [27] Highly scalable, low-complexity image coding using zeroblocks of wavelet coefficients
    Xie, G
    Shen, H
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (06) : 762 - 770
  • [28] New results in low bitrate audio coding using a combined harmonic-wavelet representation
    Boland, S
    Deriche, M
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 351 - 354
  • [29] A scalable wideband speech codec using the wavelet packet transform based on the internet low bitrate codec
    Seto, Koji
    Ogunfunmi, Tokunbo
    [J]. COMPUTER SPEECH AND LANGUAGE, 2019, 54 : 61 - 70
  • [30] LOW BITRATE AUDIO CODING USING GENERALIZED ADAPTIVE GAIN SHAPE VECTOR QUANTIZATION ACROSS CHANNELS
    Mehrotra, Sanjeev
    Chen, Wei-ge
    Kotteri, Kishore
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 9 - 12