PERCEPTUAL EVALUATION OF SPATIAL AUDIO: WHERE NEXT?

被引：0

作者：

Francombe, Jon ^{[1
]}

Brookes, Tim ^{[1
]}

Mason, Russell ^{[1
]}

机构：

[1] Univ Surrey, Inst Sound Recording, Guildford GU2 7XH, Surrey, England

来源：

PROCEEDINGS OF THE 22ND INTERNATIONAL CONGRESS ON SOUND AND VIBRATION: MAJOR CHALLENGES IN ACOUSTICS, NOISE AND VIBRATION RESEARCH, 2015 | 2015年

基金：

英国工程与自然科学研究理事会;

关键词：

MULTICHANNEL REPRODUCED SOUND; QUALITY; ATTRIBUTES;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

From the early days of reproduced sound, engineers have sought to reproduce the spatial properties of sound fields, leading to the development of a range of technologies. Two-channel stereo has been prevalent for many years; however, systems with a higher number of discrete channels (including rear and height loudspeakers) are becoming more common and, recently, there has been a move towards loudspeaker-agnostic methods using audio objects. Perceptual evaluation, and perceptually-informed objective measurement, of alternative reproduction systems can inform further development and steer future innovations. It is important, therefore, that any gaps in the field of perceptual evaluation and measurement are identified and that future work aims to fill those gaps. A standard research paradigm in the field is identification of the perceptual attributes of a stimulus set, facilitating controlled listening tests and leading to the development of predictive models. There have been numerous studies that aim to discover the perceptual attributes of reproduced spatial sound, leading to more than fifty descriptive terms. However, a literature review revealed the following key problems: (i) there is little agreement on exact definitions, nor on the relative importance of each attribute; (ii) there may be important attributes that have not yet been identified (e.g. attributes arising from differences between real and reproduced audio, or pertaining to new 3D or object-based methods); and (iii) there is no model of overall spatial quality based directly on the important attributes. Consequently, the authors contend that future research should focus on: (i) ascertaining which attributes of reproduced spatial audio are most important to listeners; (ii) identifying any important attributes currently missing; (iii) determining the relationships between the important attributes and listener preference; (iv) modelling overall spatial quality in terms of the important perceptual attributes; and (v) modelling these perceptual attributes in terms of their physical correlates.

引用

页数：8

共 50 条

[31] Modeling perceptual similarity of audio signals for blind source separation evaluation
Fox, Brendan
Sabin, Andrew
Pardo, Bryan
Zopf, Alec
INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 454 - +
[32] Blind Audio Source Separation by NTF and its Perceptual Quality Evaluation
Keyder, M. Altug
Guensel, Bilge
2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 627 - 630
[33] Three-dimensional audio parametric encoding based on perceptual characteristics of spatial cue
Zhang, Cong
Wang, Heng
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (06) : 2727 - 2736
[34] Multiple spatial reference frames underpin perceptual recalibration to audio-visual discrepancies
Watson, David Mark
Akeroyd, Michael A.
Roach, Neil W.
Webb, Ben S.
PLOS ONE, 2021, 16 (05):
[35] The perceptual basis for audio compression
Kohlrausch, Armin
PHYSICS TODAY, 2007, 60 (06) : 80 - 81
[36] Perceptual coding of digital audio
Painter, T
Spanias, A
PROCEEDINGS OF THE IEEE, 2000, 88 (04) : 451 - 513
[37] Perceptual audio hashing functions
Özer, H. (hozer@uekae.tubitak.gov.tr), 1780, Hindawi Publishing Corporation (2005):
[38] Robust perceptual audio hashing
Özer, H
Sankur, B
PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 25 - 28
[39] Perceptual Audio Hashing Functions
Hamza Özer
Bülent Sankur
Nasir Memon
Emin Anarım
EURASIP Journal on Advances in Signal Processing, 2005
[40] Perceptual audio hashing functions
Özer, H
Sankur, B
Memon, N
Anarim, E
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (12) : 1780 - 1793

← 1 2 3 4 5 →