Semantic granularity metric learning for visual search

被引:6
|
作者
Manandhar, Dipu [1 ,3 ]
Bastan, Muhammet [2 ,4 ]
Yap, Kim-Hui [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
[2] Amazon, Palo Alto, CA USA
[3] Univ Surrey, Guildford, Surrey, England
[4] Nanyang Technol Univ, Singapore, Singapore
关键词
Deep learnin; Metric learning; Metric loss functions; Semantic similarity; Visual search; IMAGE SIMILARITY; DEEP; REPRESENTATION;
D O I
10.1016/j.jvcir.2020.102871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing metric learning methods often do not consider different granularly in visual similarly. However, in many domains, images exhibit similarly at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarly ranging from clothing of the exact same instance to similar looks/design or common category. Therefore, training image triplets/pairs inherently possess different degree of information. Nevertheless, the existing methods often treat them with equal importance which hinder capturing underlying granularities in image similarly. In view of this, we propose a new semantic granularly metric learning (SGML) that develops a novel idea of detecting and leveraging attribute semantic space and integrating it into deep metric learning to capture multiple granularities of similarly. The proposed framework simultaneously learns image attributes and embeddings with multitask-CNN where the tasks are linked by semantic granularly similarly mapping to leverage correlations between the tasks. To this end, we propose a new soft-binomial deviance loss that effectively integrates informativeness of training samples into metric-learning on-the-fly during training. Compared to recent ensemble-based methods, SGML is conceptually elegant, computationally simple yet effective. Extensive experiments on benchmark datasets demonstrate its superiorly e.g., 1-4.5%-Recall@1 improvement over the state-of-the-arts (Kim a al., 2018; Cakir a al., 2019) on DeepFashion-Inshop
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Remote Sensing Image Scene Classification by Multiple Granularity Semantic Learning
    Guo, Weilong
    Li, Shengyang
    Yang, Jian
    Zhou, Zhuang
    Liu, Yunfei
    Lu, Junjie
    Kou, Longxuan
    Zhao, Manqi
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 2546 - 2562
  • [22] Regularized Contrastive Learning of Semantic Search
    Tan, Mingxi
    Rolland, Alexis
    Tian, Andong
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 119 - 130
  • [23] Learning in repeated visual search
    Michael C. Hout
    Stephen D. Goldinger
    Attention, Perception, & Psychophysics, 2010, 72 : 1267 - 1282
  • [24] Perceptual learning in visual search
    Sireteanu, R.
    Rettenbach, R.
    PERCEPTION, 1995, 24 : 20 - 21
  • [25] Locus of learning in visual search
    Walsh, V
    Ellison, A
    PERCEPTION, 1996, 25 (11) : 1374 - 1374
  • [26] Learning in repeated visual search
    Hout, Michael C.
    Goldinger, Stephen D.
    ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (05) : 1267 - 1282
  • [27] Learning visual models of semantic concepts
    Naphade, MR
    Smith, JR
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 531 - 534
  • [28] Bridging the Semantic Gap in Image Search via Visual Semantic Descriptors by Integrating Text and Visual Features
    Lekshmi, V. L.
    John, Ansamma
    COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 207 - 215
  • [29] Context dependent semantic granularity
    Albertoni, Riccardo
    Camossi, Elena
    De Martino, Monica
    Giannini, Franca
    Monti, Marina
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2011, 3 (02) : 189 - 215
  • [30] Deep Metric Learning for Open World Semantic Segmentation
    Cen, Jun
    Yun, Peng
    Cai, Junhao
    Wang, Michael Yu
    Liu, Ming
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15313 - 15322