JOINT OPTIMIZATION OF THE PERCEPTUAL CORE AND LOSSLESS COMPRESSION LAYERS IN SCALABLE AUDIO CODING

被引：3

作者：

Ravelli, Emmanuel ^{[1
]}

Melkote, Vinay ^{[1
]}

Nanjundaswamy, Tejaswi ^{[1
]}

Rose, Kenneth ^{[1
]}

机构：

[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Audio coding; lossless coding; AAC; SLS; ratedistortion optimization;

D O I：

10.1109/ICASSP.2010.5495833

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

MPEG-4 High-Definition Advanced Audio Coding (HD-AAC) enables scalable-to-lossless (SLS) audio coding with an Advanced Audio Coding (AAC) base layer, and fine-grained enhancements based on the MPEG SLS standard. While the AAC core offers better perceptual quality at lossy bit-rates, its inclusion has been observed to compromise the ultimate lossless compression performance as compared to the SLS 'non-core' (i.e., without an AAC base layer) codec. In contrast, the latter provides excellent lossless compression but with significantly degraded audio quality at low bit-rates. We propose a trellis-based approach to directly optimize the trade-off between the quality of the AAC core and the lossless compression performance of SLS. Simulations to test the effectiveness of the approach demonstrate the capability to adjust the trade-off to match application specific needs. Moreover, such optimization can in fact achieve an AAC core of superior perceptual quality while maintaining state-of-the-art (and surprisingly sometimes even better) lossless compression, all this in compliance with the HD-AAC standard.

引用

下载

页码：365 / 368

页数：4

共 50 条

[1] Joint Optimization of Base and Enhancement Layers in Scalable Audio Coding
Ravelli, Emmanuel
Melkote, Vinay
Nanjundaswamy, Tejaswi
Rose, Kenneth
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 711 - 724
[2] Joint speech/audio coding based scalable perceptual audio coding
Gao, Li
Hu, Ruimin
Yang, Yuhong
2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
[3] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
Geiger, R
Herre, J
Schuller, G
Sporer, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 445 - 448
[4] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
Geiger, R
Schuller, G
Sporer, T
Herre, J
2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 50 - 50
[5] Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)
Raad, M
Mertins, A
Burnett, I
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 624 - 627
[6] Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)
Raad, M
Mertins, A
Burnett, I
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 393 - 396
[7] Sampling rate scalable lossless audio coding
Moriya, T
Jin, A
Mori, T
Ikeda, K
Kaneko, T
2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 123 - 125
[8] Lossless scalable audio coding and quality enhancement
Moriya, T
Jin, A
Mori, T
Ikeda, K
Kaneko, T
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03): : 425 - 429
[9] ENHANCED SCALABLE TO LOSSLESS AUDIO CODING SCHEME
Shu, Haiyan
Huang, Haibin
Li, Te
Rahardja, Susanto
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 377 - 380
[10] A design of lossy and lossless scalable audio coding
Moriya, T
Iwakami, N
Jin, A
Mori, T
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 889 - 892

← 1 2 3 4 5 →