Introducing an Atypical Loss: A Perceptual Metric Learning for Image Pairing

被引:1
|
作者
Dahmane, Mohamed [1 ]
机构
[1] CRIM Comp Res Inst Montreal, Montreal, PQ H3N 1M3, Canada
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2022 | 2023年 / 13739卷
基金
加拿大自然科学与工程研究理事会;
关键词
Atypical loss; Metric learning; Visual relationship; Image pairing; Image perception; Image retrieval;
D O I
10.1007/978-3-031-20650-4_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent works have shown an interest in comparing visually similar but semantically different instances. The paired Totally Looks Like (TLL) image dataset is a good example of visually similar paired images to figure out how humans compare images. In this research, we consider these more generic annotated categories to build a semantic manifold distance. We introduce an atypical triplet-loss using the inverse Kullback-Leibler divergence to model the distribution of the anchor-positive (a-p) distances. In the new redefinition of triplet-loss, the anchor-negative (a-n) loss is conditional to the a-p distance distribution which prevents the loss correction fluctuations in the plain summed triplet-loss function of absolute distances. The proposed atypical triplet-loss builds a manifold from relative distances to a "super" anchor represented by the a-p distribution. The evaluation on the paired images of the TLL dataset showed that the retrieving score from the first candidate guess (top-1) is 75% which is x 2.5 higher compared to the recall score of the baseline triplet-loss which is limited to 29%, and with a top-5 pairing score as high as 78% which represents a gain of x1.4.
引用
收藏
页码:81 / 94
页数:14
相关论文
共 50 条
  • [41] Metric learning for weather image classification
    Lin, Fang-Ju
    Wang, Tsai-Pei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (11) : 13309 - 13321
  • [42] Tensor metric learning for image processing
    Qian, Dongyun
    Pan, Xiuqiang
    Journal of Computational Information Systems, 2015, 11 (13): : 4693 - 4699
  • [43] Introducing conditional expected loss: A novel metric for risk investment analysis
    de Lima, Jose Donizetti
    da Silva, Romel da Rosa
    Dranka, Geremi Gilson
    Ribeiro, Matheus Henrique Dal Molin
    Southier, Luiz Fernando Puttow
    ENGINEERING ECONOMIST, 2024, : 285 - 312
  • [44] PERCEPTUAL MUSICAL SIMILARITY METRIC LEARNING WITH GRAPH NEURAL NETWORKS
    Vahidi, Cyrus
    Singh, Shubhr
    Benetos, Emmanouil
    Phan, Huy
    Stowell, Dan
    Fazekas, Gyorgy
    Lagrange, Mathieu
    2023 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, WASPAA, 2023,
  • [45] Similarity Retention Loss (SRL) Based on Deep Metric Learning for Remote Sensing Image Retrieval
    Zhao, Hongwei
    Yuan, Lin
    Zhao, Haoyu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (02)
  • [46] Deep metric learning with mirror attention and fine triplet loss for fundus image retrieval in ophthalmology
    Fang, Jiansheng
    Zeng, Ming
    Zhang, Xiaoqing
    Liu, Hongbo
    Zhao, Yitian
    Zhang, Peng
    Yang, Hong
    Liu, Junling
    Miao, Hanpei
    Hu, Yan
    Liu, Jiang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 80
  • [47] A Perceptual Image Quality Assessment Metric Using Singular Value Decomposition
    Wang, Shuigen
    Cui, Dongshun
    Wang, Baoxian
    Zhao, Baojun
    Yang, Jinglin
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (01) : 209 - 229
  • [48] Pseudo No Reference image quality metric using perceptual data hiding
    Ninassi, Alexandre
    Le Callet, Patrick
    Autrusseau, Florent
    HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [49] A perceptual metric for stereoscopic image quality assessment based on the binocular energy
    Rafik Bensalma
    Mohamed-Chaker Larabi
    Multidimensional Systems and Signal Processing, 2013, 24 : 281 - 316
  • [50] Misalignment Insensitive Perceptual Metric for Full Reference Image Quality Assessment
    Yao, Shunyu
    Cao, Yue
    Zhang, Yabo
    Zuo, Wangmeng
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI, 2024, 14435 : 444 - 456