A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

被引:31
|
作者
Manocha, Pranay [1 ]
Finkelstein, Adam [1 ]
Zhang, Richard [2 ]
Bryan, Nicholas J. [2 ]
Mysore, Gautham J. [2 ]
Jin, Zeyu [2 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Adobe Res, San Jose, CA USA
来源
关键词
QUALITY ASSESSMENT;
D O I
10.21437/Interspeech.2020-1191
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many audio processing tasks require perceptual assessment. The "gold standard" of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute but often correlate poorly with human judgment, particularly for audio differences at the threshold of human detection. In this work, we construct a metric by fitting a deep neural network to a new large dataset of crowdsourced human judgments. Subjects are prompted to answer a straightforward, objective question: are two recordings identical or not? These pairs are algorithmically generated under a variety of perturbations, including noise, reverb, and compression artifacts; the perturbation space is probed with the goal of efficiently identifying the just-noticeable difference (JND) level of the subject. We show that the resulting learned metric is well-calibrated with human judgments, outperforming baseline methods. Since it is a deep network, the metric is differentiable, making it suitable as a loss function for other tasks. Thus, simply replacing an existing loss (e.g., deep feature loss) with our metric yields significant improvement in a denoising network, as measured by subjective pairwise comparison.
引用
收藏
页码:2852 / 2856
页数:5
相关论文
共 50 条
  • [41] JUST NOTICEABLE DIFFERENCES FOR CHANGES OF INTERAURAL TIME DIFFERENCES AS A FUNCTION OF INTERAURAL TIME DIFFERENCES
    CAMPBELL, RA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1959, 31 (01): : 123 - 123
  • [42] An HEVC-compliant perceptual video coding using just noticeable difference
    Guo, Jiefeng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (01) : 1257 - 1286
  • [43] Just-Noticeable-Quantization-Distortion Based Preprocessing for Perceptual Video Coding
    Ki, Sehwan
    Kim, Munchurl
    Ko, Hyunsuk
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [44] Estimation of just-noticeable differences in multispectral color space
    Mironova, I
    Toivanen, P
    CGIV 2004: SECOND EUROPEAN CONFERENCE ON COLOR IN GRAPHICS, IMAGING, AND VISION - CONFERENCE PROCEEDINGS, 2004, : 376 - 378
  • [45] Just-Noticeable Difference-Based Perceptual Optimization for JPEG Compression
    Zhang, Xinfeng
    Wang, Shiqi
    Gu, Ke
    Lin, Weisi
    Ma, Siwei
    Gao, Wen
    IEEE SIGNAL PROCESSING LETTERS, 2017, 24 (01) : 96 - 100
  • [46] An HEVC-compliant perceptual video coding using just noticeable difference
    Jiefeng Guo
    Multimedia Tools and Applications, 2022, 81 : 1257 - 1286
  • [47] Perceptual Image Compression with Block-Level Just Noticeable Difference Prediction
    Tian, Tao
    Wang, Hanli
    Kwong, Sam
    Kuo, C-C Jay
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 16 (04)
  • [48] Perceptual Video Coding based on Visual Saliency Modulated Just Noticeable Distortion
    Cui, Jing
    Xiong, Ruiqin
    Zhang, Xinfeng
    Wang, Shanshe
    Ma, Siwei
    2019 DATA COMPRESSION CONFERENCE (DCC), 2019, : 565 - 565
  • [49] JUST-NOTICEABLE DIFFERENCES FOR SOLID-AREA NOISE
    HAMERLY, JR
    JOURNAL OF APPLIED PHOTOGRAPHIC ENGINEERING, 1983, 9 (01): : 14 - 17
  • [50] Just noticeable differences in sound quality metrics for refrigerator noise
    You, Jin
    Jeon, Jin Yong
    NOISE CONTROL ENGINEERING JOURNAL, 2008, 56 (06) : 414 - 424