Multi-step Coding Structure of Spatial Audio Object Coding

被引：1

作者：

Hu, Chenhao ^{[1
,2
]}

Hu, Ruimin ^{[1
,2
]}

Wang, Xiaochen ^{[1
,2
]}

Wu, Tingzhao ^{[1
,2
]}

Li, Dengshi ^{[1
,3
]}

机构：

[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Wuhan, Peoples R China

[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan, Peoples R China

[3] Jianghan Univ, Sch Math & Comp, Wuhan, Peoples R China

来源：

MULTIMEDIA MODELING (MMM 2020), PT I | 2020年 / 11961卷

基金：

国家重点研发计划;

关键词：

Audio object coding; Residual coding; Spatial audio;

D O I：

10.1007/978-3-030-37731-1_54

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The spatial audio object coding (SAOC) is an effective method which compresses multiple audio objects and provides flexibility for personalized rendering in interactive services. It divides each frame signal into 28 sub-bands and extracts one set object spatial parameters for each sub-band. Objects can be coded into a downmix signal and a few parameters by this way. However, using same parameters in one sub-band will cause frequency aliasing distortion, which seriously impacts listening experience. Existing studies to improve SAOC cannot guarantee that all audio objects can be decoded well. This paper describes a new multi-step object coding structure to efficient calculate residual of each object as additional side information to compensate the aliasing distortion of each object. In this multi-step structure, a sorting strategy based on sub-band energy of each object is proposed to determine which audio object should be encoded in each step, because the object encoding order will affect the final decoded quality. The singular value decomposition (SVD) is used to reduce the increasing bit-rate due to the added side information. From the experiment results, the performance of proposed method is better than SAOC and SAOC-TSC, and each object can be decoded well with respect to the bit-rate and the sound quality.

引用

页码：666 / 678

页数：13

共 50 条

[41] Huffman coding in Advanced Audio Coding standard
Brzuchalski, Grzegorz
PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2012, 2012, 8454
[42] Efficient Transient Signal Detection in Spatial Cue based Multi-Channel Audio Coding
Lee, Byunghwa
Hahn, Minsoo
Kim, Kwangki
Kim, Jinsul
2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA), 2014,
[43] Multi-Step Rolling Ultra-Short-Term Load Forecasting Based on the Optimized Sparse Coding
Chu C.
Qin C.
Ju P.
Zhao J.
Zhao J.
Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2021, 36 (19): : 4050 - 4059
[44] Multi-step prediction method for robust object tracking
Firouznia, Marjan
Faez, Karim
Amindavar, Hamidreza
Koupaei, Javad Alikhani
Pantano, Pietro
Bilotta, Eleonora
DIGITAL SIGNAL PROCESSING, 2017, 70 : 94 - 104
[45] Parametric audio coding
Edler, B
Purnhagen, H
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 21 - 24
[46] Coding of audio signals
Johnston, J
Herre, J
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2003, 51 (05): : 446 - 446
[47] CODING FOR DIGITAL AUDIO
RUDNICK, P
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1978, 26 (7-8): : 579 - 579
[48] BOSI ON AUDIO CODING
LOUIE, G
CHINN, R
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 1995, 43 (10): : 881 - 881
[49] Parametric audio coding
Edler, B
Purnhagen, H
2000 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY PROCEEDINGS, VOLS. I & II, 2000, : 614 - 617
[50] Improved channel level difference quantization for spatial audio coding
Kim, Kwangki
Beack, Seungkwon
Seo, Jeongil
Jang, Daeyoung
Hahn, Minsoo
ETRI JOURNAL, 2007, 29 (01) : 99 - 102

← 1 2 3 4 5 →