Multi-step Coding Structure of Spatial Audio Object Coding

被引:1
|
作者
Hu, Chenhao [1 ,2 ]
Hu, Ruimin [1 ,2 ]
Wang, Xiaochen [1 ,2 ]
Wu, Tingzhao [1 ,2 ]
Li, Dengshi [1 ,3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China
[3] Jianghan Univ, Sch Math & Comp, Wuhan, Peoples R China
来源
基金
国家重点研发计划;
关键词
Audio object coding; Residual coding; Spatial audio;
D O I
10.1007/978-3-030-37731-1_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spatial audio object coding (SAOC) is an effective method which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal into 28 sub-bands and extracts one set object spatial parameters for each sub-band. Objects can be coded into a downmix signal and a few parameters by this way. However, using same parameters in one sub-band will cause frequency aliasing distortion, which seriously impacts listening experience. Existing studies to improve SAOC cannot guarantee that all audio objects can be decoded well. This paper describes a new multi-step object coding structure to efficient calculate residual of each object as additional side information to compensate the aliasing distortion of each object. In this multi-step structure, a sorting strategy based on sub-band energy of each object is proposed to determine which audio object should be encoded in each step, because the object encoding order will affect the final decoded quality. The singular value decomposition (SVD) is used to reduce the increasing bit-rate due to the added side information. From the experiment results, the performance of proposed method is better than SAOC and SAOC-TSC, and each object can be decoded well with respect to the bit-rate and the sound quality.
引用
收藏
页码:666 / 678
页数:13
相关论文
共 50 条
  • [1] Spatial Audio Object Coding With Two-Step Coding Structure for Interactive Audio Service
    Kim, Kwangki
    Seo, Jeongil
    Beack, Seungkwon
    Kang, Kyeongok
    Hahn, Minsoo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (06) : 1208 - 1216
  • [2] Efficient Residual Coding Method of Spatial Audio Object Coding with Two-Step Coding Structure for Interactive Audio Services
    Lee, Byonghwa
    Kim, Kwangki
    Hahn, Minsoo
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (07): : 1949 - 1952
  • [3] Mastering Signal Processing with Residual Coding Scheme in Spatial Audio Object Coding
    Kim, Kwangki
    Jong, Byeong-ok
    Park, Sanghyun
    Won, Yonggwan
    Kim, Jinsul
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [4] Modified Spatial Audio Object Coding Scheme with Harmonic Extraction and Elimination Structure for Interactive Audio Service
    Park, Jihoon
    Kim, Kwangki
    Seo, Jeongil
    Hahn, Minsoo
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2906 - +
  • [5] MPEG Spatial Audio Object Coding-The ISO/MPEG Standard for Efficient Coding of Interactive Audio Scenes
    Herre, Juergen
    Purnhagen, Heiko
    Koppens, Jeroen
    Hellmuth, Oliver
    Engdegard, Jonas
    Hilpert, Johannes
    Villemoes, Lars
    Terentiv, Leon
    Falch, Cornelia
    Hoelzer, Andreas
    Valero, Maria Luis
    Resch, Barbara
    Mundt, Harald
    Oh, Hyen-O
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2012, 60 (09): : 655 - 673
  • [6] MPEG spatial audio object coding-The ISO/MPEG standard for efficient coding of interactive audio scenes
    Herre, J. (juergen.herre@audiolabs-erlangen.de), 1600, Audio Engineering Society (60):
  • [7] DECORRELATION FOR AUDIO OBJECT CODING
    Villemoes, Lars
    Hirvonen, Toni
    Purnhagen, Heiko
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 706 - 710
  • [8] Audio object coding based on N-step residual compensating
    Chenhao Hu
    Xiaochen Wang
    Ruimin Hu
    Yulin Wu
    Multimedia Tools and Applications, 2021, 80 : 18717 - 18733
  • [9] Audio object coding based on N-step residual compensating
    Hu, Chenhao
    Wang, Xiaochen
    Hu, Ruimin
    Wu, Yulin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (12) : 18717 - 18733
  • [10] Interactive teleconferencing combining Spatial Audio Object Coding and DirAC technology
    Herre, Jürgen
    Falch, Cornelia
    Mahne, Dirk
    Del Galdo, Giovanni
    Kallinger, Markus
    Thiergart, Oliver
    AES: Journal of the Audio Engineering Society, 2011, 59 (12): : 924 - 935