AN IMPROVED METRIC OF INFORMATIONAL MASKING FOR PERCEPTUAL AUDIO QUALITY MEASUREMENT

被引:1
|
作者
Delgado, Pablo M. [1 ]
Herre, Juergen [1 ,2 ]
机构
[1] Int Audio Labs Erlangen, Wolfsmantel 33, D-91058 Erlangen, Germany
[2] Fraunhofer IIS, Wolfsmantel 33, D-91058 Erlangen, Germany
关键词
Psychoacoustics; Cognitive Modeling; Objective Audio Quality Assessment; PEAQ; ViSQOL; MODEL;
D O I
10.1109/WASPAA58266.2023.10248080
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Perceptual audio quality measurement systems algorithmically analyze the output of audio processing systems to estimate possible perceived quality degradation using perceptual models of human audition. In this manner, they save the time and resources associated with the design and execution of listening tests (LTs). Models of disturbance audibility predicting peripheral auditory masking in quality measurement systems have considerably increased subjective quality prediction performance of signals processed by perceptual audio codecs. Additionally, cognitive effects have also been known to regulate perceived distortion severity by influencing their salience. However, the performance gains due to cognitive effect models in quality measurement systems were inconsistent so far, particularly for music signals. Firstly, this paper presents an improved model of informational masking (IM) - an important cognitive effect in quality perception - that considers disturbance information complexity around the masking threshold. Secondly, we incorporate the proposed IM metric into a quality measurement system using a novel interaction analysis procedure between cognitive effects and distortion metrics. The procedure establishes interactions between cognitive effects and distortion metrics using LT data. The proposed IM metric is shown to outperform previously proposed IM metrics in a validation task against subjective quality scores from large and diverse LT databases. Particularly, the proposed system showed an increased quality prediction of music signals coded with bandwidth extension techniques, where other models frequently fail.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A Perceptual Blind Blur Image Quality Metric
    Kerouh, Fatma
    Serir, Amina
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [32] VISUAL QUALITY METRIC FOR PERCEPTUAL VIDEO CODING
    Xu, Long
    Ma, Lin
    Ngan, King Ngi
    Lin, Weisi
    Weng, Ying
    2013 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP 2013), 2013,
  • [33] A perceptual quality metric for color interpolated images
    Guarneri, I
    Guarnera, M
    Bosco, A
    Santoro, G
    Image Quality and System Performance II, 2005, 5668 : 61 - 69
  • [34] Perceptual quality metric for digital video coding
    Suthaharan, S
    ELECTRONICS LETTERS, 2003, 39 (05) : 431 - 433
  • [35] Perceptual Quality Metric With Internal Generative Mechanism
    Wu, Jinjian
    Lin, Weisi
    Shi, Guangming
    Liu, Anmin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (01) : 43 - 54
  • [36] A NEW PERCEPTUAL QUALITY METRIC FOR COMPRESSED VIDEO
    Bhat, Abharana
    Richardson, Iain
    Kannangara, Sampath
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 933 - 936
  • [37] Improving perceptual coding of wideband audio signal when taking into consideration of temporal masking
    Zakharenko, A
    Kowalguin, Y
    ARCHITECTURAL ACOUSTICS AND SOUND REINFORCEMENT, 2002, : 235 - 239
  • [38] Perceptual Quality of Audio-Visual Content with Common Video and Audio Degradations
    Becerra Martinez, Helard
    Hines, Andrew
    Farias, Mylene C. Q.
    APPLIED SCIENCES-BASEL, 2021, 11 (13):
  • [39] Perceptual Coding of High-Quality Digital Audio
    Brandenburg, Karlheinz
    Faller, Christof
    Herre, Juergen
    Johnston, James D.
    Kleijn, W. Bastiaan
    PROCEEDINGS OF THE IEEE, 2013, 101 (09) : 1905 - 1919
  • [40] Objective Assessment of Perceptual Audio Quality Using ViSQOLAudio
    Sloan, Colm
    Harte, Naomi
    Kelly, Damien
    Kokaram, Anil C.
    Hines, Andrew
    IEEE TRANSACTIONS ON BROADCASTING, 2017, 63 (04) : 693 - 705