Introducing an Atypical Loss: A Perceptual Metric Learning for Image Pairing

被引:1
|
作者
Dahmane, Mohamed [1 ]
机构
[1] CRIM Comp Res Inst Montreal, Montreal, PQ H3N 1M3, Canada
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2022 | 2023年 / 13739卷
基金
加拿大自然科学与工程研究理事会;
关键词
Atypical loss; Metric learning; Visual relationship; Image pairing; Image perception; Image retrieval;
D O I
10.1007/978-3-031-20650-4_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown an interest in comparing visually similar but semantically different instances. The paired Totally Looks Like (TLL) image dataset is a good example of visually similar paired images to figure out how humans compare images. In this research, we consider these more generic annotated categories to build a semantic manifold distance. We introduce an atypical triplet-loss using the inverse Kullback-Leibler divergence to model the distribution of the anchor-positive (a-p) distances. In the new redefinition of triplet-loss, the anchor-negative (a-n) loss is conditional to the a-p distance distribution which prevents the loss correction fluctuations in the plain summed triplet-loss function of absolute distances. The proposed atypical triplet-loss builds a manifold from relative distances to a "super" anchor represented by the a-p distribution. The evaluation on the paired images of the TLL dataset showed that the retrieving score from the first candidate guess (top-1) is 75% which is x 2.5 higher compared to the recall score of the baseline triplet-loss which is limited to 29%, and with a top-5 pairing score as high as 78% which represents a gain of x1.4.
引用
收藏
页码:81 / 94
页数:14
相关论文
共 50 条
  • [21] Metric Learning for Image Registration
    Niethammer, Marc
    Kwitt, Roland
    Vialard, Francois-Xavier
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 8455 - 8464
  • [22] Metric Learning For Image Steganalysis
    Chen, Guoming
    Chen, Qiang
    Zhang, Dong
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 898 - 902
  • [23] Perceptual Dissimilarity Metric: A Full Reference Objective Image Quality Measure to Quantify the Degradation of Perceptual Image Quality
    Saha, Sajib
    Tahtali, Murat
    Lambert, Andrew
    Pickering, Mark
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 327 - 332
  • [24] Improving the performance of DEMUCS in speech enhancement with the perceptual metric loss
    Wu, Zong-Tai
    Chen, Yon-Tong
    Hung, Jeih-weih
    2022 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN, IEEE ICCE-TW 2022, 2022, : 267 - 268
  • [25] Modeling Perceptual Color Differences by Local Metric Learning
    Perrot, Michael
    Habrard, Amaury
    Muselet, Damien
    Sebban, Marc
    COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 96 - 111
  • [26] A perceptual quality metric for image fusion based on regional information
    Chen, H
    Varshney, PK
    Multisensor, Multisource Information Fusion: Architectures, Algorithms and Applications 2005, 2005, 5813 : 34 - 45
  • [27] Wavelet based image compression using perceptual distortion metric
    Goswami, H
    Kozaitis, SP
    VISUAL INFORMATION PROCESSING XII, 2003, 5108 : 93 - 97
  • [28] Image Colorization Using GANs and Perceptual Loss
    Sankar, Rahul
    Nair, Ashwin
    Abhinav, Prince
    Mothukuri, Siva Krishna P.
    Koolagudi, Shashidhar G.
    2020 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SIGNAL PROCESSING (AISP), 2020,
  • [29] Spread Spectrum Image Watermarking Based on Perceptual Quality Metric
    Zhang, Fan
    Liu, Wenyu
    Lin, Weisi
    Ngan, King Ngi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (11) : 3207 - 3218
  • [30] On the development of a reduced-reference perceptual image quality metric
    Kusuma, TM
    Zepernick, HJ
    Caldera, M
    2005 SYSTEMS COMMUNICATIONS, PROCEEDINGS: ICW 2005, WIRELESS TECHNOLOGIES; ICHSN 2005, HIGH SPEED NETWORKS; ICMCS 2005, MULTIMEDIA COMMUNICATIONS SYSTEMS; SENET 2005, SENSOR NETWORKS, 2005, : 178 - 184