A psychoacoustic model for audio coding based on a cochlear filter bank

被引：0

作者：

Baumgarte, F ^{[1
]}

机构：

[1] Agere Syst, Media Signal Proc Res, Murray Hill, NJ 07974 USA

来源：

PROCEEDINGS OF THE 2001 IEEE WORKSHOP ON THE APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS | 2001年

关键词：

D O I：

10.1109/ASPAA.2001.969562

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is derived from a psychoacoustic model mimicking the psychoacoustics of masking. Current applications use a uniform spectral decomposition as first stage of that model to approximate the frequency selectivity of the human auditory system. The availability of efficient implementations led to a virtually exclusive use of uniform decompositions in audio coding. However, the equal filter properties of the uniform sub-bands are not in line with the nonuniform auditory filters. This paper presents a psychoacoustic model based on an efficient nonuniform cochlear filter bank with a simplified less complex post-processing for estimating the masked threshold. Application results in audio coding show a significantly better performance in terms of bit rate and/or quality of the new model in comparison with other state-of-the-art models with a uniform spectral decomposition.

引用

下载

页码：139 / 142

页数：4

共 50 条

[11] A fully scalable audio coding structure with embedded psychoacoustic model
Li, Te
Rahardja, Susanto
Koh, Soo Ngee
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 205 - +
[12] New Speech Compression Technique based on Filter Bank Design and Psychoacoustic Model
Talbi, Mourad
Bouhlel, Med Salim
INTERNATIONAL JOURNAL OF ACOUSTICS AND VIBRATION, 2019, 24 (04): : 728 - 735
[13] A matched FIR filter bank for audio coding.
Guido, RC
Vieira, LS
Sanchez, FL
Slaets, JFW
Almeida, LO
Gonzaga, A
Bianchi, M
ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 796 - 801
[14] The design of a hybrid filter bank for the psychoacoustic model in ISO/MPEG phases 1,2 audio encoder
Liu, CM
Lee, WC
IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 1997, 43 (03) : 586 - 592
[15] The design of a hybrid filter bank for the psychoacoustic model in ISO/MPEG phase 1, 2 audio encoder+
Liu, CM
Lee, WC
INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS, 1997 DIGEST OF TECHNICAL PAPERS, 1997, : 208 - 209
[16] EMD AND PSYCHOACOUSTIC MODEL BASED WATERMARKING FOR AUDIO
Wang, Liang
Emmanuel, Sabu
Kankanhalli, Mohan S.
2010 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME 2010), 2010, : 1427 - 1432
[17] An excitation level based psychoacoustic model for audio compression
Wang, Y
Vilermo, M
ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 401 - 404
[18] Imperceptible adversarial audio steganography based on psychoacoustic model
Chen, Lang
Wang, Rangding
Dong, Li
Yan, Diqun
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (17) : 26451 - 26463
[19] Imperceptible adversarial audio steganography based on psychoacoustic model
Lang Chen
Rangding Wang
Li Dong
Diqun Yan
Multimedia Tools and Applications, 2023, 82 : 26451 - 26463
[20] A Kalman filter based on wavelet filter-bank and psychoacoustic modeling for speech enhancement
Shao, Yu
Chang, Chip-Hong
2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 121 - +

← 1 2 3 4 5 →