Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service

被引:14
|
作者
Kim, Kwangki [1 ]
Seo, Jeongil [2 ]
Beack, Seungkwon [2 ]
Kang, Kyeongok [2 ]
Hahn, Minsoo [3 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Informat & Commun Engn, Taejon 305701, South Korea
[2] Elect & Telecommun Res Inst, Taejon 305606, South Korea
[3] Korea Adv Inst Sci & Technol, Dept Elect Engn, Taejon 305701, South Korea
关键词
Audio object; interactive audio service; residual coding; spatial audio object coding;
D O I
10.1109/TMM.2011.2168197
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An interactive audio service is a new conceptual audio service that provides the users with opportunities for a variety of experiences on the alternative and advanced audio services. In the interactive audio service, users can freely control various audio objects to make their own audio sounds. A spatial audio object coding (SAOC) is a useful technology that can support most parts of the interactive audio service with a relatively low bit-rate, but is very poor to perfect gain control of a certain audio object, i.e., the target audio object. In this paper, the SAOC with a two-step coding structure is proposed to efficiently handle the target audio object as well as the normal audio objects. A transform coded excitation (TCX) based residual coding scheme is presented in the context of the sound quality enhancement. From experimental results, it can be noted that the various audio objects can be successfully handled with respect to the bit-rate and the sound quality by using the proposed two-step coding structure SAOC.
引用
收藏
页码:1208 / 1216
页数:9
相关论文
共 50 条
  • [41] ESTIMATING SPATIAL CUES FOR AUDIO CODING IN MDCT DOMAIN
    Chen, Shuixian
    Hu, Ruimin
    Zhang, Shuhua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 53 - +
  • [42] Distributed spatial audio coding in wireless hearing aids
    Roy, Olivier
    Vetterli, Martin
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 53 - 56
  • [43] Multichannel matching pursuit and applications to spatial audio coding
    Goodwin, Michael M.
    2006 FORTIETH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, VOLS 1-5, 2006, : 1114 - 1118
  • [44] EFFICIENT MERGING OF MULTIPLE AUDIO STREAMS FOR SPATIAL SOUND REPRODUCTION IN DIRECTIONAL AUDIO CODING
    Del Galdo, Giovanni
    Kuech, Fabian
    Kallinger, Markus
    Schultz-Amling, Richard
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 265 - 268
  • [45] Huffman coding in Advanced Audio Coding standard
    Brzuchalski, Grzegorz
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2012, 2012, 8454
  • [46] Audio object coding based on optimal parameter frequency resolution
    Wu, Tingzhao
    Hu, Ruimin
    Wang, Xiaochen
    Ke, Shanfa
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (15) : 20723 - 20738
  • [47] Parameter Domain Loudness Estimation in Parametric Audio Object Coding
    Paulus, Jouni
    2018 26TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2018, : 2469 - 2473
  • [48] MULTICHANNEL OBJECT-BASED AUDIO CODING WITH CONTROLLABLE QUALITY
    Gorlow, Stanislaw
    Habets, Emanuel A. P.
    Marchand, Sylvain
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 561 - 565
  • [49] Audio object coding based on optimal parameter frequency resolution
    Tingzhao Wu
    Ruimin Hu
    Xiaochen Wang
    Shanfa Ke
    Multimedia Tools and Applications, 2019, 78 : 20723 - 20738
  • [50] Multi-channel audio service in a Terrestrial-DMB system using VSLI-based spatial audio coding
    Seo, J
    Moon, HG
    Beack, S
    Kang, K
    Hong, JK
    ETRI JOURNAL, 2005, 27 (05) : 635 - 638