Low delay perceptually lossless coding of audio signals

被引:2
|
作者
Dorward, S [1 ]
Huang, DW [1 ]
Savari, SA [1 ]
Schuller, G [1 ]
Yu, B [1 ]
机构
[1] Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
D O I
10.1109/DCC.2001.917162
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel predictive lossless coding scheme is proposed. The prediction is based on a new weighted cascaded least mean squared (WCLMS) method. To obtain both a high compression ratio and a very low encoding and decoding delay, the residuals from the prediction are encoded using either a variant of adaptive Huffman coding or a version of adaptive arithmetic coding. WCLMS is especially designed for music/speech signals. It can be used either in combination with psycho-acoustically pre-filtered signals (an idea presented in [1]) to obtain perceptually lossless coding, or as a stand-alone lossless coder. Experiments on a database of moderate size and a variety of pre-filtered mono-signals show that the proposed lossless coder (which needs about 2 bit/sample for pre-filtered signals) outperforms competing lossless coders, such as ppmz, bzip2, Shorten, and LPAC, in terms of compression ratios. The combination of WCLMS with either of the adaptive coding schemes is also shown to achieve better compression ratios and lower delay than an earlier scheme combining WCLMS with Huffman coding over blocks of 4096 samples.
引用
收藏
页码:312 / 320
页数:9
相关论文
共 50 条
  • [1] Context Lossless Coding of Audio Signals
    Ulacha, Grzegorz
    Stasinski, Ryszard
    [J]. 2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 523 - 523
  • [2] Lossless coding of audio signals using cascaded prediction
    Schuller, G
    Yu, B
    Huang, DW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 3273 - 3276
  • [3] Perceptually lossless medical image coding
    Wu, D
    Tan, DM
    Baird, M
    DeCampo, J
    White, C
    Wu, HR
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2006, 25 (03) : 335 - 344
  • [4] Low delay filterbanks for enhanced low delay audio coding
    Schnell, Markus
    Geiger, Ralf
    Schmidt, Markus
    Multrus, Markus
    Mellar, Michael
    Herre, Juergen
    Schuller, Gerald
    [J]. 2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 49 - +
  • [5] Lossless coding for audio discs
    Craven, P
    Gerzon, M
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1996, 44 (09): : 706 - 720
  • [6] Perceptually Transparent Watermarking of Audio Signals
    Atriek, Anshul
    Kaur, Arashdeep
    [J]. 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 458 - 462
  • [7] Perceptually-weighted audio coding that scales to extremely low bitrates
    Kandadai, Srivatsan
    Creusere, Charles D.
    [J]. DCC 2006: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2006, : 382 - +
  • [8] Lossless coding of audio signals using cascaded peak to valley linear prediction
    El-Sonni, ME
    El-Sonbaty, Y
    Tobail, AF
    [J]. 2005 13th IEEE International Conference on Networks Jointly held with the 2005 7th IEEE Malaysia International Conference on Communications, Proceedings 1 and 2, 2005, : 791 - 796
  • [9] Frequency warping in low delay audio coding
    Wabnik, S
    Schuller, G
    Krämer, U
    Hirschfeld, J
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 181 - 184
  • [10] A fine granular scalable perceptually lossy and lossless audio codec
    Yu, R
    Lin, X
    Rahardja, S
    Ko, CC
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 65 - 68