Structured audio, Kolmogorov complexity, and generalized audio coding

被引:5
|
作者
Scheirer, ED [1 ]
机构
[1] MIT, Media Lab, Machine Listening Grp, Cambridge, MA 02139 USA
来源
关键词
audio compression; MPEG-4; sound synthesis; structured audio;
D O I
10.1109/89.966095
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Structured-audio techniques are a recent development in audio coding that develop new connections between the existing practices of audio synthesis and audio compression. A theoretical basis for this coding model is presented, grounded in information theory and Kolmogorov complexity theory. It is demonstrated that algorithmic structured audio can provide higher compression ratios than other techniques for many audio signals and proved rigorously that it can provide compression at least as good as every other technique (up to a constant term) for every audio signal. The MPEG-4 Structured Audio standard is the first practical application of algorithmic coding theory. It points the direction toward a new paradigm of generalized audio coding, in which structured-audio coding subsumes all other audio-coding techniques. Generalized audio coding offers new marketplace models that enable advances in compression technology to be rapidly leveraged toward the solution of problems in audio coding.
引用
收藏
页码:914 / 931
页数:18
相关论文
共 50 条
  • [21] SUBJECTIVE EVALUATION OF 4 LOW-COMPLEXITY AUDIO CODING SCHEMES
    JOSEPH, SM
    MAHER, RC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (06): : 3657 - 3662
  • [22] Complexity scalable audio coding algorithm based on wavelet packet decomposition
    He, DM
    Gao, W
    Wu, JQ
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 659 - 665
  • [23] The Kolmogorov complexity, universal distribution, and coding theorem for generalized length functions
    Kobayashi, K
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1997, 43 (03) : 816 - 826
  • [24] Scalable Audio Coding based on Spatial Perception in Audio Surveillance
    Liu, Hui
    Gao, Li
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 734 - 737
  • [25] Audio object coding for distributed audio data management applications
    Melih, K
    Gonzalez, R
    ICCS 2002: 8TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2002, : 727 - 731
  • [26] Adaptive Audio Steganography Based on Advanced Audio Coding and Syndrome-Trellis Coding
    Luo, Weiqi
    Zhang, Yue
    Li, Haodong
    DIGITAL FORENSICS AND WATERMARKING, 2017, 10431 : 177 - 186
  • [27] Source segmentation for structured audio
    Melih, K
    Gonzalez, R
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 811 - 814
  • [28] Audio coding for conversion to MIDI
    Sieger, NJ
    Tewfik, AH
    1997 IEEE FIRST WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1997, : 101 - 106
  • [29] DECORRELATION FOR AUDIO OBJECT CODING
    Villemoes, Lars
    Hirvonen, Toni
    Purnhagen, Heiko
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 706 - 710
  • [30] ATSC video and audio coding
    Davidson, GA
    Isnardi, MA
    Fielder, LD
    Goldman, MS
    Todd, CC
    PROCEEDINGS OF THE IEEE, 2006, 94 (01) : 60 - 76