JOINT OPTIMIZATION OF THE PERCEPTUAL CORE AND LOSSLESS COMPRESSION LAYERS IN SCALABLE AUDIO CODING

被引:3
|
作者
Ravelli, Emmanuel [1 ]
Melkote, Vinay [1 ]
Nanjundaswamy, Tejaswi [1 ]
Rose, Kenneth [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
关键词
Audio coding; lossless coding; AAC; SLS; ratedistortion optimization;
D O I
10.1109/ICASSP.2010.5495833
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
MPEG-4 High-Definition Advanced Audio Coding (HD-AAC) enables scalable-to-lossless (SLS) audio coding with an Advanced Audio Coding (AAC) base layer, and fine-grained enhancements based on the MPEG SLS standard. While the AAC core offers better perceptual quality at lossy bit-rates, its inclusion has been observed to compromise the ultimate lossless compression performance as compared to the SLS 'non-core' (i.e., without an AAC base layer) codec. In contrast, the latter provides excellent lossless compression but with significantly degraded audio quality at low bit-rates. We propose a trellis-based approach to directly optimize the trade-off between the quality of the AAC core and the lossless compression performance of SLS. Simulations to test the effectiveness of the approach demonstrate the capability to adjust the trade-off to match application specific needs. Moreover, such optimization can in fact achieve an AAC core of superior perceptual quality while maintaining state-of-the-art (and surprisingly sometimes even better) lossless compression, all this in compliance with the HD-AAC standard.
引用
下载
收藏
页码:365 / 368
页数:4
相关论文
共 50 条
  • [1] Joint Optimization of Base and Enhancement Layers in Scalable Audio Coding
    Ravelli, Emmanuel
    Melkote, Vinay
    Nanjundaswamy, Tejaswi
    Rose, Kenneth
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (04): : 711 - 724
  • [2] Joint speech/audio coding based scalable perceptual audio coding
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    2014 IEEE/ACIS 13TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2014, : 419 - 424
  • [3] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
    Geiger, R
    Herre, J
    Schuller, G
    Sporer, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 445 - 448
  • [4] Fine grain scalable perceptual and lossless audio coding based on IntMDCT
    Geiger, R
    Schuller, G
    Sporer, T
    Herre, J
    2003 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS PROCEEDINGS, 2003, : 50 - 50
  • [5] Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)
    Raad, M
    Mertins, A
    Burnett, I
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO AND ELECTROACOUSTICS MULTIMEDIA SIGNAL PROCESSING, 2003, : 624 - 627
  • [6] Scalable to lossless audio compression based on perceptual set partitioning in hierarchical trees (PSPIHT)
    Raad, M
    Mertins, A
    Burnett, I
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 393 - 396
  • [7] Sampling rate scalable lossless audio coding
    Moriya, T
    Jin, A
    Mori, T
    Ikeda, K
    Kaneko, T
    2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 123 - 125
  • [8] Lossless scalable audio coding and quality enhancement
    Moriya, T
    Jin, A
    Mori, T
    Ikeda, K
    Kaneko, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03): : 425 - 429
  • [9] ENHANCED SCALABLE TO LOSSLESS AUDIO CODING SCHEME
    Shu, Haiyan
    Huang, Haibin
    Li, Te
    Rahardja, Susanto
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 377 - 380
  • [10] A design of lossy and lossless scalable audio coding
    Moriya, T
    Iwakami, N
    Jin, A
    Mori, T
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 889 - 892