A System Architecture for Semantically Informed Rendering of Object-Based Audio

被引：2

作者：

Franck, Andreas ^{[1
]}

Francombe, Jon ^{[2
]}

Woodcock, James ^{[3
]}

Hughes, Richard ^{[3
]}

Coleman, Philip ^{[4
]}

Menzies, Dylan ^{[1
]}

Cox, Trevor J. ^{[3
]}

Jackson, Philip J. B. ^{[5
]}

Fazi, Filippo Maria ^{[1
]}

机构：

[1] Univ Southampton, Inst Sound & Vibrat Res, Southampton SO17 1BJ, Hants, England

[2] MediaCityUK, BBC Res & Dev, Dock House, Salford M50 2LH, Lancs, England

[3] Univ Salford, Acoust Res Ctr, Salford M5 4WT, Lancs, England

[4] Univ Surrey, Inst Sound Recording, Guildford GU2 7XH, Surrey, England

[5] Univ Surrey, Ctr Vis Speech & Signal Proc, Guildford GU2 7XH, Surrey, England

来源：

JOURNAL OF THE AUDIO ENGINEERING SOCIETY | 2019年 / 67卷 / 7-8期

基金：

英国工程与自然科学研究理事会;

关键词：

Adaptation strategies - Audio rendering - Object trajectories - Perceptual attributes - Personalizations - Software frameworks - Spatial audio - System architectures;

D O I：

10.17743/jaes.2019.0025

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Object-based audio promises format-agnostic reproduction and extensive personalization of spatial audio content. However, in practical listening scenarios, such as in consumer audio. ideal reproduction is typically not possible. To maximize the quality of listening experience, a different approach is required, for example modifications of metadata to adjust for the reproduction layout or personalization choices. In this paper we propose a novel system architecture for semantically informed rendering (SIR), that combines object audio rendering with high-level processing of object metadata. In many cases, this processing uses novel, advanced metadata describing the objects to optimally adjust the audio scene to the reproduction system or listener preferences. The proposed system is evaluated with several adaptation strategies, including semantically motivated downmix to layouts with few loudspeakers, manipulation of perceptual attributes, perceptual reverberation compensation, and orchestration of mobile devices for immersive reproduction. These examples demonstrate how SIR can significantly improve the media experience and provide advanced personalization controls, for example by maintaining smooth object trajectories on systems with few loudspeakers, or providing personalized envelopment levels. An example implementation of the proposed system architecture is described and provided as an open, extensible software framework that combines object-based audio rendering and high-level processing of advanced object metadata.

引用

页码：498 / 509

页数：12

共 50 条

[41] An architecture for building reliable distributed object-based systems
Wang, L
Zhou, WL
TOOLS 24: TECHNOLOGY OF OBJECT-ORIENTED LANGUAGES, PROCEEDINGS, 1998, 24 : 260 - 265
[42] Object-based HyperVideo authoring system
Chang, HB
Hsu, HH
Liao, YC
Shih, TK
Tang, CT
2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXP (ICME), VOLS 1-3, 2004, : 2219 - 2222
[43] Object-based simulation modelling system
Beijing Univ of Aeronautics and, Astronautics, Beijing, China
Beijing Hangkong Hangtian Daxue Xuebao, 5 (607-611):
[44] Algorithms for multiplex scheduling of object-based audio-visual presentations
Kalva, H
Eleftheriadis, A
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (12) : 1283 - 1293
[45] Determination and Validation of Mix Parameters for Modifying Envelopment in Object-Based Audio
Francombe, Jon
Brookes, Tim
Mason, Russell
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 127 - 145
[46] Perceptual Evaluation of Blind Source Separation in Object-Based Audio Production
Coleman, Philip
Liu, Qingju
Francombe, Jon
Jackson, Philip J. B.
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 558 - 567
[47] Object-based audio streaming over error-prone channels
Marks, SK
Gonzalez, R
2005 IEEE International Conference on Multimedia and Expo (ICME), Vols 1 and 2, 2005, : 261 - 264
[48] Object-Based Benefits Without Object-Based Representations
Fougnie, Daryl
Cormiea, Sarah M.
Alvarez, George A.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2013, 142 (03) : 621 - 626
[49] PERCEPTUAL LOUDNESS COMPENSATION IN INTERACTIVE OBJECT-BASED AUDIO CODING SYSTEMS
Paulus, Jouni
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 579 - 583
[50] Bit rate required for mono audio object in object-based audio program compressed with MPEG-H 3D Audio
Sugimoto, Takehiro
ACOUSTICAL SCIENCE AND TECHNOLOGY, 2023, 44 (02) : 93 - 100

← 1 2 3 4 5 →