Semantic granularity metric learning for visual search

被引：6

作者：

Manandhar, Dipu ^{[1
,3
]}

Bastan, Muhammet ^{[2
,4
]}

Yap, Kim-Hui ^{[1
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore

[2] Amazon, Palo Alto, CA USA

[3] Univ Surrey, Guildford, Surrey, England

[4] Nanyang Technol Univ, Singapore, Singapore

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2020年 / 72卷 / 72期

关键词：

Deep learnin; Metric learning; Metric loss functions; Semantic similarity; Visual search; IMAGE SIMILARITY; DEEP; REPRESENTATION;

D O I：

10.1016/j.jvcir.2020.102871

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing metric learning methods often do not consider different granularly in visual similarly. However, in many domains, images exhibit similarly at multiple granularities with visual semantic concepts, e.g. fashion demonstrates similarly ranging from clothing of the exact same instance to similar looks/design or common category. Therefore, training image triplets/pairs inherently possess different degree of information. Nevertheless, the existing methods often treat them with equal importance which hinder capturing underlying granularities in image similarly. In view of this, we propose a new semantic granularly metric learning (SGML) that develops a novel idea of detecting and leveraging attribute semantic space and integrating it into deep metric learning to capture multiple granularities of similarly. The proposed framework simultaneously learns image attributes and embeddings with multitask-CNN where the tasks are linked by semantic granularly similarly mapping to leverage correlations between the tasks. To this end, we propose a new soft-binomial deviance loss that effectively integrates informativeness of training samples into metric-learning on-the-fly during training. Compared to recent ensemble-based methods, SGML is conceptually elegant, computationally simple yet effective. Extensive experiments on benchmark datasets demonstrate its superiorly e.g., 1-4.5%-Recall@1 improvement over the state-of-the-arts (Kim a al., 2018; Cakir a al., 2019) on DeepFashion-Inshop

引用

页数：11

共 50 条

[21] Remote Sensing Image Scene Classification by Multiple Granularity Semantic Learning
Guo, Weilong
Li, Shengyang
Yang, Jian
Zhou, Zhuang
Liu, Yunfei
Lu, Junjie
Kou, Longxuan
Zhao, Manqi
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2022, 15 : 2546 - 2562
[22] Regularized Contrastive Learning of Semantic Search
Tan, Mingxi
Rolland, Alexis
Tian, Andong
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 119 - 130
[23] Learning in repeated visual search
Michael C. Hout
Stephen D. Goldinger
Attention, Perception, & Psychophysics, 2010, 72 : 1267 - 1282
[24] Perceptual learning in visual search
Sireteanu, R.
Rettenbach, R.
PERCEPTION, 1995, 24 : 20 - 21
[25] Locus of learning in visual search
Walsh, V
Ellison, A
PERCEPTION, 1996, 25 (11) : 1374 - 1374
[26] Learning in repeated visual search
Hout, Michael C.
Goldinger, Stephen D.
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (05) : 1267 - 1282
[27] Learning visual models of semantic concepts
Naphade, MR
Smith, JR
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 531 - 534
[28] Bridging the Semantic Gap in Image Search via Visual Semantic Descriptors by Integrating Text and Visual Features
Lekshmi, V. L.
John, Ansamma
COMPUTATIONAL INTELLIGENCE, CYBER SECURITY AND COMPUTATIONAL MODELS, ICC3 2015, 2016, 412 : 207 - 215
[29] Context dependent semantic granularity
Albertoni, Riccardo
Camossi, Elena
De Martino, Monica
Giannini, Franca
Monti, Marina
INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2011, 3 (02) : 189 - 215
[30] Deep Metric Learning for Open World Semantic Segmentation
Cen, Jun
Yun, Peng
Cai, Junhao
Wang, Michael Yu
Liu, Ming
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15313 - 15322

← 1 2 3 4 5 →