A psychoacoustic model for audio coding based on a cochlear filter bank

被引:0
|
作者
Baumgarte, F [1 ]
机构
[1] Agere Syst, Media Signal Proc Res, Murray Hill, NJ 07974 USA
关键词
D O I
10.1109/ASPAA.2001.969562
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the psychoacoustics of masking. Current applications use a uniform spectral decomposition as first stage of that model to approximate the frequency selectivity of the human auditory system. The availability of efficient implementations led to a virtually exclusive use of uniform decompositions in audio coding. However, the equal filter properties of the uniform sub-bands are not in line with the nonuniform auditory filters. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank with a simplified less complex post-processing for estimating the masked threshold. Application results in audio coding show a significantly better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models with a uniform spectral decomposition.
引用
下载
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [11] A fully scalable audio coding structure with embedded psychoacoustic model
    Li, Te
    Rahardja, Susanto
    Koh, Soo Ngee
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 205 - +
  • [12] New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model
    Talbi, Mourad
    Bouhlel, Med Salim
    INTERNATIONAL JOURNAL OF ACOUSTICS AND VIBRATION, 2019, 24 (04): : 728 - 735
  • [13] A matched FIR filter bank for audio coding.
    Guido, RC
    Vieira, LS
    Sanchez, FL
    Slaets, JFW
    Almeida, LO
    Gonzaga, A
    Bianchi, M
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 796 - 801
  • [14] The design of a hybrid filter bank for the psychoacoustic model in ISO/MPEG phases 1,2 audio encoder
    Liu, CM
    Lee, WC
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1997, 43 (03) : 586 - 592
  • [15] The design of a hybrid filter bank for the psychoacoustic model in ISO/MPEG phase 1, 2 audio encoder+
    Liu, CM
    Lee, WC
    INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1997 DIGEST OF TECHNICAL PAPERS, 1997, : 208 - 209
  • [16] EMD AND PSYCHOACOUSTIC MODEL BASED WATERMARKING FOR AUDIO
    Wang, Liang
    Emmanuel, Sabu
    Kankanhalli, Mohan S.
    2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1427 - 1432
  • [17] An excitation level based psychoacoustic model for audio compression
    Wang, Y
    Vilermo, M
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 401 - 404
  • [18] Imperceptible adversarial audio steganography based on psychoacoustic model
    Chen, Lang
    Wang, Rangding
    Dong, Li
    Yan, Diqun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26451 - 26463
  • [19] Imperceptible adversarial audio steganography based on psychoacoustic model
    Lang Chen
    Rangding Wang
    Li Dong
    Diqun Yan
    Multimedia Tools and Applications, 2023, 82 : 26451 - 26463
  • [20] A Kalman filter based on wavelet filter-bank and psychoacoustic modeling for speech enhancement
    Shao, Yu
    Chang, Chip-Hong
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 121 - +