High-Order-Interaction for weakly supervised Fine-Grained Visual Categorization

被引:9
|
作者
Wang, Junzheng [1 ,5 ]
Li, Nanyu [3 ,4 ]
Luo, Zhiming [1 ,5 ]
Zhong, Zhun [2 ]
Li, Shaozi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Dept Comp Sci, Kunming, Yunnan, Peoples R China
[4] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[5] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Fine-Grained Visual Categorization; High-Order-Interaction; Trilinear pooling; ATTENTION;
D O I
10.1016/j.neucom.2021.08.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-Grained Visual Categorization (FGVC) is a challenging task due to the large intra-subcategory and small inter-subcategory variances. Recent studies tackle this task through a weakly supervised manner without using the part annotation from the experts. Of those, methods based on bilinear pooling are one of the main categories for computing the interaction between deep features and have shown high effectiveness. However, these methods mainly focus on the correlation within one specific layer but largely ignore the high interactions between multiple layers. In this study, we argue that considering the high interaction between the features from multiple layers can help to learn more distinguishing finegrained features. To this end, we propose a High-Order-Interaction (HOI) method for FGVC. In our HOI, an efficient cross-layer trilinear pooling is introduced to calculate the third-order interaction between three different layers. Third-order interactions of different combinations are then fused to form the final representation. HOI can produce more discriminative representations and be readily integrated with the two popular techniques, attention mechanism and triplet loss, to obtain superposed improvement. Extensive experiments conducted on four FGVC datasets show the great superiority of our method over bilinear-based methods and demonstrate that the proposed method achieves the state of the art. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 36
页数:10
相关论文
共 50 条
  • [1] Weakly Supervised Semantic and Attentive Data Mixing Augmentation for Fine-Grained Visual Categorization
    He, Mengqi
    Cheng, Qilong
    Qi, Guanqiu
    IEEE ACCESS, 2022, 10 : 35814 - 35823
  • [2] The Image Data and Backbone in Weakly Supervised Fine-Grained Visual Categorization: A Revisit and Further Thinking
    Ye, Shuo
    Wang, Yu
    Peng, Qinmu
    You, Xinge
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 2 - 16
  • [3] Weakly Supervised Fine-Grained Categorization With Part-Based Image Representation
    Zhang, Yu
    Wei, Xiu-Shen
    Wu, Jianxin
    Cai, Jianfei
    Lu, Jiangbo
    Nguyen, Viet-Anh
    Do, Minh N.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (04) : 1713 - 1725
  • [4] A Saliency-based Weakly-supervised Network for Fine-Grained Image Categorization
    Han, Yawen
    Meng, Fang
    2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 270 - 274
  • [5] A weakly supervised spatial group attention network for fine-grained visual recognition
    Xie, Jiangjian
    Zhong, Yujie
    Zhang, Junguo
    Zhang, Changchun
    Schuller, Bjoern W.
    APPLIED INTELLIGENCE, 2023, 53 (20) : 23301 - 23315
  • [6] A weakly supervised spatial group attention network for fine-grained visual recognition
    Jiangjian Xie
    Yujie Zhong
    Junguo Zhang
    Changchun Zhang
    Björn W Schuller
    Applied Intelligence, 2023, 53 : 23301 - 23315
  • [7] Feathers Dataset for Fine-Grained Visual Categorization
    Belko, Alina
    Dobratulin, Konstantin
    Kuznetsov, Andrey
    THIRTEENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2020), 2021, 11605
  • [8] Webly-Supervised Fine-Grained Visual Categorization via Deep Domain Adaptation
    Xu, Zhe
    Huang, Shaoli
    Zhang, Ya
    Tao, Dacheng
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) : 1100 - 1113
  • [9] Multi-Order Feature Statistical Model for Fine-Grained Visual Categorization
    Wang, Qingtao
    Zhang, Ke
    Fan, Jin
    Huang, Shaoli
    Zhang, Lianbo
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7379 - 7386
  • [10] Weakly Supervised Posture Mining for Fine-grained Classification
    Tang, Zhenchao
    Yang, Hualin
    Chen, Calvin Yu-Chian
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23735 - 23744