Low complexity scalable perceptual audio coder using an optimum wavelet packet basis representation and vector quantization

被引:0
|
作者
Sathidevi, PS [1 ]
Venkataramani, Y [1 ]
机构
[1] Natl Inst Technol, Dept Elect Engn, Kerala, India
关键词
wavelet packets; psychoacoustic model; audio compression; scalability; vector quantization;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we describe a high quality low complexity scalable audio coding scheme, using an optimum wavelet packet (WP) basis signal representation based on the time varying characteristics of the audio signal. In ISO/MPEG audio coding standards [1-3], resolution of decomposition filterbank (uniform) does not match with the resolution of psychoacoustic model (which requires more resolution and needs to be matched with the critical bands (non uniform) of cochlea). Hence MPEG coder uses a separate high resolution decomposition filterbank for,psychoacoustic model implementation, which increases the computational load of the coder. Here, we use a wavelet packet decomposition structure closely matching to the critical bands [4,5] of human auditory system, to transform the data into wavelet domain and then these wavelet packet coefficients are used to drive the psychoacoustic model directly. Hence, psychoacoustic model design is integrated with the design of decomposition filterbank. Other features of the proposed coder are scalability (can support three standard industrial sampling frequencies 11.025 kHz, 22.050 kHz and 44.1 kHz) and optimum wavelet basis selection from a predefined library of wavelet bases, by extracting seven statistical features of the audio signal to be encoded. A new Vector Quantization (VQ) scheme is also proposed here, in which the length of the code book can be varied in accordance with the psychoacoustic model requirement. Experimental results show that the proposed coder yields almost transparent quality with compression ratios in the range of 6 to 10.
引用
收藏
页码:399 / 407
页数:9
相关论文
共 39 条
  • [1] Fixed bit rate perceptual wavelet packet audio coder
    Gunawan, TS
    Ambikairajah, E
    Epps, J
    [J]. 2004 9TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), 2004, : 235 - 239
  • [2] Perceptual audio coding using sinusoidal/optimum wavelet representation
    Sathidevi, PS
    Venkataramani, Y
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2002, 21 (05) : 511 - 524
  • [3] Perceptual Audio Coding Using Sinusoidal/Optimum Wavelet Representation
    P.S. Sathidevi
    Y. Venkataramani
    [J]. Circuits, Systems and Signal Processing, 2002, 21 : 511 - 524
  • [4] A bitstream scalable audio coder using a hybrid WLPC-wavelet representation
    Ning, D
    Deriche, M
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 417 - 420
  • [5] Audio coding using the wavelet packet transform and a combined scalar-vector quantization
    Boland, S
    Deriche, M
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1041 - 1044
  • [6] Reduced rate ultra low delay audio coder using multistage vector quantization
    Sreenivas, T. V.
    Wabnik, Stefan
    Schuller, Gerald
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 2080 - +
  • [7] Complexity scalable audio coding algorithm based on wavelet packet decomposition
    He, DM
    Gao, W
    Wu, JQ
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 659 - 665
  • [8] Low complexity, low delay and scalable audio coding scheme based on a novel statistical perceptual quantization procedure
    Abad, Cesar Alonso
    Fernandez, Miguel Angel Martin
    Lopez, Carlos Alberola
    [J]. SIGMAP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2007, : 31 - +
  • [9] Compressive Sensing Based Scalable Speech Coder with Dynamic Selection of Basis and Vector Quantization
    Sankar, M. S. Arun
    Sathidevi, P. S.
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1053 - 1058
  • [10] High quality low complexity scalable wavelet audio coding
    Dobson, WK
    Yang, JJ
    Smart, KJ
    Guo, FK
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 327 - 330