PERCEPTUAL EVALUATION OF SPATIAL AUDIO: WHERE NEXT?

被引:0
|
作者
Francombe, Jon [1 ]
Brookes, Tim [1 ]
Mason, Russell [1 ]
机构
[1] Univ Surrey, Inst Sound Recording, Guildford GU2 7XH, Surrey, England
基金
英国工程与自然科学研究理事会;
关键词
MULTICHANNEL REPRODUCED SOUND; QUALITY; ATTRIBUTES;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
From the early days of reproduced sound, engineers have sought to reproduce the spatial properties of sound fields, leading to the development of a range of technologies. Two-channel stereo has been prevalent for many years; however, systems with a higher number of discrete channels (including rear and height loudspeakers) are becoming more common and, recently, there has been a move towards loudspeaker-agnostic methods using audio objects. Perceptual evaluation, and perceptually-informed objective measurement, of alternative reproduction systems can inform further development and steer future innovations. It is important, therefore, that any gaps in the field of perceptual evaluation and measurement are identified and that future work aims to fill those gaps. A standard research paradigm in the field is identification of the perceptual attributes of a stimulus set, facilitating controlled listening tests and leading to the development of predictive models. There have been numerous studies that aim to discover the perceptual attributes of reproduced spatial sound, leading to more than fifty descriptive terms. However, a literature review revealed the following key problems: (i) there is little agreement on exact definitions, nor on the relative importance of each attribute; (ii) there may be important attributes that have not yet been identified (e.g. attributes arising from differences between real and reproduced audio, or pertaining to new 3D or object-based methods); and (iii) there is no model of overall spatial quality based directly on the important attributes. Consequently, the authors contend that future research should focus on: (i) ascertaining which attributes of reproduced spatial audio are most important to listeners; (ii) identifying any important attributes currently missing; (iii) determining the relationships between the important attributes and listener preference; (iv) modelling overall spatial quality in terms of the important perceptual attributes; and (v) modelling these perceptual attributes in terms of their physical correlates.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Modeling perceptual similarity of audio signals for blind source separation evaluation
    Fox, Brendan
    Sabin, Andrew
    Pardo, Bryan
    Zopf, Alec
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2007, 4666 : 454 - +
  • [32] Blind Audio Source Separation by NTF and its Perceptual Quality Evaluation
    Keyder, M. Altug
    Guensel, Bilge
    2008 IEEE 16TH SIGNAL PROCESSING, COMMUNICATION AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2008, : 627 - 630
  • [33] Three-dimensional audio parametric encoding based on perceptual characteristics of spatial cue
    Zhang, Cong
    Wang, Heng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2015, 29 (06) : 2727 - 2736
  • [34] Multiple spatial reference frames underpin perceptual recalibration to audio-visual discrepancies
    Watson, David Mark
    Akeroyd, Michael A.
    Roach, Neil W.
    Webb, Ben S.
    PLOS ONE, 2021, 16 (05):
  • [35] The perceptual basis for audio compression
    Kohlrausch, Armin
    PHYSICS TODAY, 2007, 60 (06) : 80 - 81
  • [36] Perceptual coding of digital audio
    Painter, T
    Spanias, A
    PROCEEDINGS OF THE IEEE, 2000, 88 (04) : 451 - 513
  • [37] Perceptual audio hashing functions
    Özer, H. (hozer@uekae.tubitak.gov.tr), 1780, Hindawi Publishing Corporation (2005):
  • [38] Robust perceptual audio hashing
    Özer, H
    Sankur, B
    PROCEEDINGS OF THE IEEE 12TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, 2004, : 25 - 28
  • [39] Perceptual Audio Hashing Functions
    Hamza Özer
    Bülent Sankur
    Nasir Memon
    Emin Anarım
    EURASIP Journal on Advances in Signal Processing, 2005
  • [40] Perceptual audio hashing functions
    Özer, H
    Sankur, B
    Memon, N
    Anarim, E
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2005, 2005 (12) : 1780 - 1793