A psychoacoustic model for audio coding based on a cochlear filter bank

被引:0
|
作者
Baumgarte, F [1 ]
机构
[1] Agere Syst, Media Signal Proc Res, Murray Hill, NJ 07974 USA
关键词
D O I
10.1109/ASPAA.2001.969562
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the psychoacoustics of masking. Current applications use a uniform spectral decomposition as first stage of that model to approximate the frequency selectivity of the human auditory system. The availability of efficient implementations led to a virtually exclusive use of uniform decompositions in audio coding. However, the equal filter properties of the uniform sub-bands are not in line with the nonuniform auditory filters. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank with a simplified less complex post-processing for estimating the masked threshold. Application results in audio coding show a significantly better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models with a uniform spectral decomposition.
引用
下载
收藏
页码:139 / 142
页数:4
相关论文
共 50 条
  • [21] Oversampled filter bank evaluation for joint subband audio processing and coding
    Hermann, David
    Chau, Edward
    Dony, Robert D.
    Areibi, Shawki M.
    2008 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-4, 2008, : 526 - +
  • [22] FPGA prototyping of an in-situ reconfigurable filter bank for audio coding
    Naviner, LAD
    Naviner, JF
    de Barros, MA
    Proceedings of the 46th IEEE International Midwest Symposium on Circuits & Systems, Vols 1-3, 2003, : 847 - 850
  • [23] PEAQ-based psychoacoustic model for perceptual audio coder
    Hu, XP
    He, GM
    Hou, XP
    8th International Conference on Advanced Communication Technology, Vols 1-3: TOWARD THE ERA OF UBIQUITOUS NETWORKS AND SOCIETIES, 2006, : U1819 - U1823
  • [24] Audio steganalysis based on reversed psychoacoustic model of human hearing
    Ghasemzadeh, Hamzeh
    Khass, Mehdi Tajik
    Arjmandi, Meisam Khalil
    DIGITAL SIGNAL PROCESSING, 2016, 51 : 133 - 141
  • [25] Digital watermarks for audio signal based on psychoacoustic masking model
    Nakayama, A
    Lu, JL
    Nakamura, S
    Shikano, K
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART III-FUNDAMENTAL ELECTRONIC SCIENCE, 2003, 86 (12): : 65 - 75
  • [26] Audio watermarking based on psychoacoustic model and adaptive wavelet packets
    Quan, XM
    Zhang, HB
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 2518 - 2521
  • [27] Architecture design of MDCT-based Psychoacoustic Model co-processor in MPEG advanced audio coding
    Tsai, TH
    Huang, SW
    Wang, YW
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 2, PROCEEDINGS, 2004, : 761 - 764
  • [28] Chaos based audio watermarking with MPEG psychoacoustic model I
    Giovanardi, A
    Mazzini, G
    Tomassetti, M
    ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1609 - 1613
  • [29] Audio watermarking algorithm based on wavelet packet and psychoacoustic model
    Wang, RD
    Xu, DW
    Li, Q
    PDCAT 2005: Sixth International Conference on Parallel and Distributed Computing, Applications and Technologies, Proceedings, 2005, : 812 - 814
  • [30] Assessment of Audio Signal Noise Reduction Based on Psychoacoustic Model
    Balik, Miroslav
    Raso, Ondrej
    2018 25TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2018,