A psychoacoustic model for audio coding based on a cochlear filter bank

被引:0
|
作者
Baumgarte, F [1 ]
机构
[1] Agere Syst, Media Signal Proc Res, Murray Hill, NJ 07974 USA
关键词
D O I
10.1109/ASPAA.2001.969562
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the psychoacoustics of masking. Current applications use a uniform spectral decomposition as first stage of that model to approximate the frequency selectivity of the human auditory system. The availability of efficient implementations led to a virtually exclusive use of uniform decompositions in audio coding. However, the equal filter properties of the uniform sub-bands are not in line with the nonuniform auditory filters. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank with a simplified less complex post-processing for estimating the masked threshold. Application results in audio coding show a significantly better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models with a uniform spectral decomposition.
引用
下载
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [1] Improved audio coding using a psychoacoustic model based on a cochlear filter bank
    Baumgarte, F
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (07): : 495 - 503
  • [2] A computationally efficient cochlear filter bank for perceptual audio coding
    Baumgarte, F
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3265 - 3268
  • [3] Quantization and psychoacoustic model in audio coding in Advanced Audio Coding
    Brzuchalski, Grzegorz
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [4] Audio coding algorithm based on wavelet packet and psychoacoustic model
    He, Dongmei
    Gao, Wen
    2000, Sci Press (37):
  • [5] Audio coding algorithm based on wavelet packet and psychoacoustic model
    He, Dongmei
    Gao, Wen
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (03): : 329 - 335
  • [6] Perceptual filter design for audio coding using psychoacoustic modelling
    Lam, YH
    Stewart, RW
    ELECTRONICS LETTERS, 1998, 34 (08) : 747 - 748
  • [7] Perceptual filter design for audio coding using psychoacoustic modelling
    Univ of Strathclyde, Glasgow, United Kingdom
    Electron Lett, 8 (747-748):
  • [8] Audio coding using a psychoacoustic pre- and post-filter
    Edler, B
    Schuller, G
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 881 - 884
  • [9] An iterated rational filter bank for audio coding
    Blu, T
    PROCEEDINGS OF THE IEEE-SP INTERNATIONAL SYMPOSIUM ON TIME-FREQUENCY AND TIME-SCALE ANALYSIS, 1996, : 81 - 84
  • [10] Audio coding with signal adaptive block based filter bank switching
    Saleem, M.
    Ali, M. T.
    2007 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2007, : 435 - +