Low complexity scalable perceptual audio coder using an optimum wavelet packet basis representation and vector quantization

被引：0

作者：

Sathidevi, PS ^{[1
]}

Venkataramani, Y ^{[1
]}

机构：

[1] Natl Inst Technol, Dept Elect Engn, Kerala, India

来源：

IETE JOURNAL OF RESEARCH | 2004年 / 50卷 / 06期

关键词：

wavelet packets; psychoacoustic model; audio compression; scalability; vector quantization;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we describe a high quality low complexity scalable audio coding scheme, using an optimum wavelet packet (WP) basis signal representation based on the time varying characteristics of the audio signal. In ISO/MPEG audio coding standards [1-3], resolution of decomposition filterbank (uniform) does not match with the resolution of psychoacoustic model (which requires more resolution and needs to be matched with the critical bands (non uniform) of cochlea). Hence MPEG coder uses a separate high resolution decomposition filterbank for,psychoacoustic model implementation, which increases the computational load of the coder. Here, we use a wavelet packet decomposition structure closely matching to the critical bands [4,5] of human auditory system, to transform the data into wavelet domain and then these wavelet packet coefficients are used to drive the psychoacoustic model directly. Hence, psychoacoustic model design is integrated with the design of decomposition filterbank. Other features of the proposed coder are scalability (can support three standard industrial sampling frequencies 11.025 kHz, 22.050 kHz and 44.1 kHz) and optimum wavelet basis selection from a predefined library of wavelet bases, by extracting seven statistical features of the audio signal to be encoded. A new Vector Quantization (VQ) scheme is also proposed here, in which the length of the code book can be varied in accordance with the psychoacoustic model requirement. Experimental results show that the proposed coder yields almost transparent quality with compression ratios in the range of 6 to 10.

引用

页码：399 / 407

页数：9

共 39 条

[1] Fixed bit rate perceptual wavelet packet audio coder
Gunawan, TS
Ambikairajah, E
Epps, J
[J]. 2004 9TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), 2004, : 235 - 239
[2] Perceptual audio coding using sinusoidal/optimum wavelet representation
Sathidevi, PS
Venkataramani, Y
[J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2002, 21 (05) : 511 - 524
[3] Perceptual Audio Coding Using Sinusoidal/Optimum Wavelet Representation
P.S. Sathidevi
Y. Venkataramani
[J]. Circuits, Systems and Signal Processing, 2002, 21 : 511 - 524
[4] A bitstream scalable audio coder using a hybrid WLPC-wavelet representation
Ning, D
Deriche, M
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 417 - 420
[5] Audio coding using the wavelet packet transform and a combined scalar-vector quantization
Boland, S
Deriche, M
[J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 1041 - 1044
[6] Reduced rate ultra low delay audio coder using multistage vector quantization
Sreenivas, T. V.
Wabnik, Stefan
Schuller, Gerald
[J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 2080 - +
[7] Complexity scalable audio coding algorithm based on wavelet packet decomposition
He, DM
Gao, W
Wu, JQ
[J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 659 - 665
[8] Low complexity, low delay and scalable audio coding scheme based on a novel statistical perceptual quantization procedure
Abad, Cesar Alonso
Fernandez, Miguel Angel Martin
Lopez, Carlos Alberola
[J]. SIGMAP 2007: PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2007, : 31 - +
[9] Compressive Sensing Based Scalable Speech Coder with Dynamic Selection of Basis and Vector Quantization
Sankar, M. S. Arun
Sathidevi, P. S.
[J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 1053 - 1058
[10] High quality low complexity scalable wavelet audio coding
Dobson, WK
Yang, JJ
Smart, KJ
Guo, FK
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 327 - 330

← 1 2 3 4 →