A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

被引:31
|
作者
Manocha, Pranay [1 ]
Finkelstein, Adam [1 ]
Zhang, Richard [2 ]
Bryan, Nicholas J. [2 ]
Mysore, Gautham J. [2 ]
Jin, Zeyu [2 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
[2] Adobe Res, San Jose, CA USA
来源
关键词
QUALITY ASSESSMENT;
D O I
10.21437/Interspeech.2020-1191
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Many audio processing tasks require perceptual assessment. The "gold standard" of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute but often correlate poorly with human judgment, particularly for audio differences at the threshold of human detection. In this work, we construct a metric by fitting a deep neural network to a new large dataset of crowdsourced human judgments. Subjects are prompted to answer a straightforward, objective question: are two recordings identical or not? These pairs are algorithmically generated under a variety of perturbations, including noise, reverb, and compression artifacts; the perturbation space is probed with the goal of efficiently identifying the just-noticeable difference (JND) level of the subject. We show that the resulting learned metric is well-calibrated with human judgments, outperforming baseline methods. Since it is a deep network, the metric is differentiable, making it suitable as a loss function for other tasks. Thus, simply replacing an existing loss (e.g., deep feature loss) with our metric yields significant improvement in a denoising network, as measured by subjective pairwise comparison.
引用
收藏
页码:2852 / 2856
页数:5
相关论文
共 50 条
  • [21] AVS Encoding Optimization with Perceptual Just Noticeable Distortion Model
    Cai, Qi
    Song, Li
    2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,
  • [22] JUST NOTICEABLE DIFFERENCES FOR SEGMENT DURATION IN NATURAL SPEECH
    HUGGINS, AWF
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1972, 51 (04): : 1270 - &
  • [23] JUST-NOTICEABLE DIFFERENCES IN EDGE AND LINE BLUR
    HAMERLY, JR
    DVORAK, CA
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA, 1980, 70 (12) : 1585 - 1585
  • [24] JUST NOTICEABLE DISTORTION MAP PREDICTION FOR PERCEPTUAL MULTIVIEW VIDEO CODING
    Gao, Yu
    Xiu, Xiaoyu
    Liang, Jie
    Lin, Weisi
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 1045 - 1048
  • [25] Perceptual watermarking using a new Just-Noticeable-Difference model
    Phi Bang Nguyen
    Beghdadi, Azeddine
    Luong, Marie
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (10) : 1506 - 1525
  • [26] Just Noticeable Distortion-Based Perceptual Rate Control in HEVC
    Zhou, Mingliang
    Wei, Xuekai
    Kwong, Sam
    Jia, Weijia
    Fang, Bin
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 (29) : 7603 - 7614
  • [27] Just noticeable differences for glottal flow waveform characteristics
    Scherer, RC
    Arehart, KH
    Guo, CG
    Milstein, CF
    Horii, Y
    JOURNAL OF VOICE, 1998, 12 (01) : 21 - 30
  • [28] An Exploration of Just Noticeable Differences in Mid -Air Haptics
    Wojna, Katarzyna
    Georgiou, Orestis
    Beattie, David
    Frier, William
    Wright, Michael
    Lutteroth, Christof
    2023 IEEE WORLD HAPTICS CONFERENCE, WHC, 2023, : 410 - 416
  • [29] Using just noticeable differences to interpret test scores
    Stricker, LJ
    PSYCHOLOGICAL METHODS, 2000, 5 (04) : 415 - 424
  • [30] JUST NOTICEABLE DIFFERENCES IN SOME VEHICLE HANDLING VARIABLES
    HOFFMANN, ER
    JOUBERT, PN
    HUMAN FACTORS, 1968, 10 (03) : 263 - &