Fine-Grained Fashion Similarity Prediction by Attribute-Specific Embedding Learning

被引:15
|
作者
Dong, Jianfeng [1 ]
Ma, Zhe [2 ]
Mao, Xiaofeng [3 ]
Yang, Xun [4 ]
He, Yuan [3 ]
Hong, Richang [5 ]
Ji, Shouling [2 ,6 ]
机构
[1] Zhejiang Gongshang Univ, Coll Comp & Informat Engn, Hangzhou 310035, Peoples R China
[2] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[3] Alibaba Grp, Hangzhou 311121, Peoples R China
[4] Natl Univ Singapore, Sch Comp, Singapore 119077, Singapore
[5] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei 230009, Peoples R China
[6] Zhejiang Univ, Innovat Ctr Informat Sci, Binjiang Inst, Hangzhou 310053, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Task analysis; Location awareness; Training; Extraterrestrial measurements; Deep learning; Computer science; Fashion retrieval; fine-grained similarity; fashion understanding; image retrieval; IMAGE RETRIEVAL;
D O I
10.1109/TIP.2021.3115658
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper strives to predict fine-grained fashion similarity. In this similarity paradigm, one should pay more attention to the similarity in terms of a specific design/attribute between fashion items. For example, whether the collar designs of the two clothes are similar. It has potential value in many fashion related applications, such as fashion copyright protection. To this end, we propose an Attribute-Specific Embedding Network (ASEN) to jointly learn multiple attribute-specific embeddings, thus measure the fine-grained similarity in the corresponding space. The proposed ASEN is comprised of a global branch and a local branch. The global branch takes the whole image as input to extract features from a global perspective, while the local branch takes as input the zoomed-in region-of-interest (RoI) w.r.t. the specified attribute thus able to extract more fine-grained features. As the global branch and the local branch extract the features from different perspectives, they are complementary to each other. Additionally, in each branch, two attention modules, i.e., Attribute-aware Spatial Attention and Attribute-aware Channel Attention, are integrated to make ASEN be able to locate the related regions and capture the essential patterns under the guidance of the specified attribute, thus make the learned attribute-specific embeddings better reflect the fine-grained similarity. Extensive experiments on three fashion-related datasets, i.e., FashionAI, DARN, and DeepFashion, show the effectiveness of ASEN for fine-grained fashion similarity prediction and its potential for fashion reranking. Code and data are available at https://github.com/maryeon/asenpp.
引用
收藏
页码:8410 / 8425
页数:16
相关论文
共 50 条
  • [1] Fine-Grained Fashion Similarity Learning by Attribute-Specific Embedding Network
    Ma, Zhe
    Dong, Jianfeng
    Long, Zhongzi
    Zhang, Yao
    He, Yuan
    Xue, Hui
    Ji, Shouling
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11741 - 11748
  • [2] Attribute-specific Control Units in StyleGAN for Fine-grained Image Manipulation
    Wang, Rui
    Chen, Jian
    Yu, Gang
    Sun, Li
    Yu, Changqian
    Gao, Changxin
    Sang, Nong
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 926 - 934
  • [3] Learning Attribute and Class-Specific Representation Duet for Fine-grained Fashion Analysis
    Jiao, Yang
    Gao, Yan
    Meng, Jingjing
    Shang, Tin
    Sun, Yi
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11050 - 11059
  • [4] Learning Structured Relation Embeddings for Fine-Grained Fashion Attribute Recognition
    Zhu, Shumin
    Zou, Xingxing
    Qian, Jianjun
    Wong, Wai Keung
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1652 - 1664
  • [5] Learning Fashion Similarity Based on Hierarchical Attribute Embedding
    Yan, Cairong
    Ding, Anan
    Zhang, Yanting
    Wang, Zijian
    [J]. 2021 IEEE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2021,
  • [6] Fine-Grained Visual Attribute Extraction from Fashion Wear
    Parekh, Viral
    Shaik, Karimulla
    Biswas, Soma
    Chelliah, Muthusamy
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3968 - 3972
  • [7] Learning Fine-Grained Motion Embedding for Landscape Animation
    Xue, Hongwei
    Liu, Bei
    Yang, Huan
    Fu, Jianlong
    Li, Houqiang
    Luo, Jiebo
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 291 - 299
  • [8] Deformable Part Descriptors for Fine-grained Recognition and Attribute Prediction
    Zhang, Ning
    Farrell, Ryan
    Iandola, Forrest
    Darrell, Trevor
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 729 - 736
  • [9] Learning Fine-grained Image Similarity with Deep Ranking
    Wang, Jiang
    Song, Yang
    Leung, Thomas
    Rosenberg, Chuck
    Wang, Jingbin
    Philbin, James
    Chen, Bo
    Wu, Ying
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 1386 - 1393
  • [10] Fine-Grained Fashion Representation Learning by Online Deep Clustering
    Jiao, Yang
    Xie, Ning
    Gao, Yan
    Wang, Chien-Chih
    Sun, Yi
    [J]. COMPUTER VISION - ECCV 2022, PT XXVII, 2022, 13687 : 19 - 35