A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise

被引:5
|
作者
Derrien, O
Duhamel, P
Charbit, M
Richard, G
机构
[1] ENST, TSI, Signal & Image Proc Dept, F-75634 Paris 13, France
[2] Ecole Super Elect, LSS, CNRS, Signals & Syst Lab, F-91192 Gif Sur Yvette, France
关键词
bit-rate constraint; distortion constraint; optimization algorithm; perceptual audio coding; scale-factor; statistical model; subband quantization;
D O I
10.1109/TSA.2005.858041
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an improvement of the quantization optimization algorithm for the MPEG-Advanced Audio Coder (AAC) is presented. This algorithm, given a bit-rate constraint, minimizes the perceived distortion generated by the signal compression. The distortion can be related to the quantization error level over frequency subbands through an auditory model. Thus, optimizing the quantization requires knowledge of the rate-distortion function for each subband. When this function can be modeled in a simple way, the algorithm can take a one-loop recursive structure. However, in the MPEG AAC, the rate-distortion function is hard to characterize, since AAC makes use of nonlinear quantizers and variable length entropy coders. As a result, the standard algorithm makes use of two nested loops with a local decoder, in order to measure the error level rather than predicting its value. We first describe a partial subband modeling of the rate-distortion function of interest in the MPEG AAC. Then, using a statistical approach, we find a relationship between the error level and the so-called quantization "scale-factor" and propose a new algorithm that is basically similar to a classical one loop "bit allocation" process. Finally, we describe the complete algorithm and show that it is more efficient than the standard one.
引用
收藏
页码:1328 / 1339
页数:12
相关论文
共 50 条
  • [1] Statistical model for the quantization noise in the MPEG advanced audio coder. Application to the bit allocation algorithm
    Derrien, O
    Charbit, M
    Duhamel, P
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 1849 - 1852
  • [2] Vector quantization of scale factors in Advanced Audio Coder (AAC)
    Sreenivas, TV
    Dietz, M
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3641 - 3644
  • [3] A new subband perceptual audio coder using CELP
    van der Vrecken, O
    Hubaut, L
    Coulon, F
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 3661 - 3664
  • [4] New implementation techniques of an efficient MPEG advanced audio coder
    Kurniawati, E
    Lau, CT
    Premkumar, B
    Absar, J
    George, S
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2004, 50 (02) : 655 - 665
  • [5] Quantization and psychoacoustic model in audio coding in Advanced Audio Coding
    Brzuchalski, Grzegorz
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [6] Image coder using ant tree vector quantization algorithm
    Channa, Arshad Hussain
    Hussain, Syed Afaq
    [J]. PROCEEDINGS OF THE INMIC 2005: 9TH INTERNATIONAL MULTITOPIC CONFERENCE - PROCEEDINGS, 2005, : 657 - 662
  • [7] Subband audio coding using a perceptually hybrid vector-scalar quantization
    Yu, RS
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 827 - 830
  • [8] Reduced rate ultra low delay audio coder using multistage vector quantization
    Sreenivas, T. V.
    Wabnik, Stefan
    Schuller, Gerald
    [J]. CONFERENCE RECORD OF THE FORTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1-5, 2007, : 2080 - +
  • [9] A fast noise-scaling algorithm for uniform quantization in audio coding schemes
    Serantes, CA
    Pena, AS
    Prelcic, NG
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 339 - 342
  • [10] The D-5 lattice quantization for a 64 kbit/s low-delay subband audio coder with a 15 khz bandwidth
    Hay, K
    Mainard, L
    Saoudi, S
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 319 - 322