Multi-step Coding Structure of Spatial Audio Object Coding

被引:1
|
作者
Hu, Chenhao [1 ,2 ]
Hu, Ruimin [1 ,2 ]
Wang, Xiaochen [1 ,2 ]
Wu, Tingzhao [1 ,2 ]
Li, Dengshi [1 ,3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Jianghan Univ, Sch Math & Comp, Wuhan, Peoples R China
来源
基金
国家重点研发计划;
关键词
Audio object coding; Residual coding; Spatial audio;
D O I
10.1007/978-3-030-37731-1_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial audio object coding (SAOC) is an effective method which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal into 28 sub-bands and extracts one set object spatial parameters for each sub-band. Objects can be coded into a downmix signal and a few parameters by this way. However, using same parameters in one sub-band will cause frequency aliasing distortion, which seriously impacts listening experience. Existing studies to improve SAOC cannot guarantee that all audio objects can be decoded well. This paper describes a new multi-step object coding structure to efficient calculate residual of each object as additional side information to compensate the aliasing distortion of each object. In this multi-step structure, a sorting strategy based on sub-band energy of each object is proposed to determine which audio object should be encoded in each step, because the object encoding order will affect the final decoded quality. The singular value decomposition (SVD) is used to reduce the increasing bit-rate due to the added side information. From the experiment results, the performance of proposed method is better than SAOC and SAOC-TSC, and each object can be decoded well with respect to the bit-rate and the sound quality.
引用
收藏
页码:666 / 678
页数:13
相关论文
共 50 条
  • [21] A prototype system for object coding of musical audio
    Vincent, E
    Plumbley, MD
    2005 WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2005, : 239 - 242
  • [22] An object oriented generic audio coding architecture
    Cellario, L
    Festa, M
    Sereno, D
    Muller, JM
    Wachter, B
    1996 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLUMES 1 AND 2 - PROCEEDINGS, 1996, : 453 - 456
  • [23] Scalable Audio Coding based on Spatial Perception in Audio Surveillance
    Liu, Hui
    Gao, Li
    2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, 2014, : 734 - 737
  • [24] A SPATIAL PRIORITY BASED SCALABLE AUDIO CODING
    Gao, Li
    Hu, Ruimin
    Yang, Yuhong
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [25] Spatial sound reproduction with directional audio coding
    Pulkki, Ville
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2007, 55 (06): : 503 - 516
  • [26] Audio Object Coding Standard Technology - MPEG SAOC
    Jung, Yang-Won
    Oh, Hyen-O
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2009, 28 (07): : 630 - 639
  • [27] Quantization and psychoacoustic model in audio coding in Advanced Audio Coding
    Brzuchalski, Grzegorz
    PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2011, 2011, 8008
  • [28] SPATIAL FILTERING USING DIRECTIONAL AUDIO CODING PARAMETERS
    Kallinger, Markus
    Del Galdo, Giovanni
    Kuech, Fabian
    Mahne, Dirk
    Schultz-Amling, Richard
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 217 - 220
  • [29] ESTIMATING SPATIAL CUES FOR AUDIO CODING IN MDCT DOMAIN
    Chen, Shuixian
    Hu, Ruimin
    Zhang, Shuhua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 53 - +
  • [30] Distributed spatial audio coding in wireless hearing aids
    Roy, Olivier
    Vetterli, Martin
    2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 2007, : 53 - 56