Improved audio coding using a psychoacoustic model based on a cochlear filter bank

被引:16
|
作者
Baumgarte, F [1 ]
机构
[1] Agere Syst, Media Signal Proc Res Dept, Berkeley Hts, NJ 07922 USA
来源
关键词
audio coding; filter bank; masked threshold; model of masking; perceptual model;
D O I
10.1109/TSA.2002.804536
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the properties of masking. Most psychoacoustic models for coding applications use a uniform (equal bandwidth) spectral decomposition as a first step to approximate the frequency selectivity of the human auditory system. However, the equal filter properties of the uniform subbands do not match the nonuniform characteristics of cochlear filters and reduce the precision of psychoacoustic modeling. Even so, uniform filter banks are applied because they are computationally efficient. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank and a simple masked threshold estimation.. The novel filter-bank structure employs cascaded low-order HR filters and appropriate down-sampling to increase efficiency. The filter responses are. optimized for the modeling of auditory masking effects. Results of the new psychoacoustic model applied to audio coding show better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models using a uniform spectral decomposition. The low delay of the new model is particularly suitable for low-delay coders.
引用
收藏
页码:495 / 503
页数:9
相关论文
共 50 条
  • [1] A psychoacoustic model for audio coding based on a cochlear filter bank
    Baumgarte, F
    [J]. PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2001, : 139 - 142
  • [2] A computationally efficient cochlear filter bank for perceptual audio coding
    Baumgarte, F
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3265 - 3268
  • [3] Perceptual filter design for audio coding using psychoacoustic modelling
    Lam, YH
    Stewart, RW
    [J]. ELECTRONICS LETTERS, 1998, 34 (08) : 747 - 748
  • [4] Quantization and psychoacoustic model in audio coding in Advanced Audio Coding
    Brzuchalski, Grzegorz
    [J]. PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [5] Audio coding using a psychoacoustic pre- and post-filter
    Edler, B
    Schuller, G
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 881 - 884
  • [6] An iterated rational filter bank for audio coding
    Blu, T
    [J]. PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 81 - 84
  • [7] Audio coding with signal adaptive block based filter bank switching
    Saleem, M.
    Ali, M. T.
    [J]. 2007 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2007, : 435 - +
  • [8] A fully scalable audio coding structure with embedded psychoacoustic model
    Li, Te
    Rahardja, Susanto
    Koh, Soo Ngee
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 205 - +
  • [9] New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model
    Talbi, Mourad
    Bouhlel, Med Salim
    [J]. INTERNATIONAL JOURNAL OF ACOUSTICS AND VIBRATION, 2019, 24 (04): : 728 - 735
  • [10] A matched FIR filter bank for audio coding.
    Guido, RC
    Vieira, LS
    Sanchez, FL
    Slaets, JFW
    Almeida, LO
    Gonzaga, A
    Bianchi, M
    [J]. ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 796 - 801