High-Order-Interaction for weakly supervised Fine-Grained Visual Categorization

被引:9
|
作者
Wang, Junzheng [1 ,5 ]
Li, Nanyu [3 ,4 ]
Luo, Zhiming [1 ,5 ]
Zhong, Zhun [2 ]
Li, Shaozi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Dept Comp Sci, Kunming, Yunnan, Peoples R China
[4] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[5] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Fine-Grained Visual Categorization; High-Order-Interaction; Trilinear pooling; ATTENTION;
D O I
10.1016/j.neucom.2021.08.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-Grained Visual Categorization (FGVC) is a challenging task due to the large intra-subcategory and small inter-subcategory variances. Recent studies tackle this task through a weakly supervised manner without using the part annotation from the experts. Of those, methods based on bilinear pooling are one of the main categories for computing the interaction between deep features and have shown high effectiveness. However, these methods mainly focus on the correlation within one specific layer but largely ignore the high interactions between multiple layers. In this study, we argue that considering the high interaction between the features from multiple layers can help to learn more distinguishing finegrained features. To this end, we propose a High-Order-Interaction (HOI) method for FGVC. In our HOI, an efficient cross-layer trilinear pooling is introduced to calculate the third-order interaction between three different layers. Third-order interactions of different combinations are then fused to form the final representation. HOI can produce more discriminative representations and be readily integrated with the two popular techniques, attention mechanism and triplet loss, to obtain superposed improvement. Extensive experiments conducted on four FGVC datasets show the great superiority of our method over bilinear-based methods and demonstrate that the proposed method achieves the state of the art. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 36
页数:10
相关论文
共 50 条
  • [21] Fine-Grained Visual Categorization of Fasteners in Overhaul Processes
    Taheritanjani, Sajjad
    Haladjian, Juan
    Bruegge, Bernd
    CONFERENCE PROCEEDINGS OF 2019 5TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS (ICCAR), 2019, : 241 - 248
  • [22] Part-Aware Fine-Grained Object Categorization Using Weakly Supervised Part Detection Network
    Zhang, Yabin
    Jia, Kui
    Wang, Zhixin
    IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (05) : 1345 - 1357
  • [23] Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition
    Sikdar, Arindam
    Liu, Yonghuai
    Kedarisetty, Siddhardha
    Zhao, Yitian
    Ahmed, Amr
    Behera, Ardhendu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2025, 133 (04) : 1755 - 1779
  • [24] Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition
    Arindam Sikdar
    Yonghuai Liu
    Siddhardha Kedarisetty
    Yitian Zhao
    Amr Ahmed
    Ardhendu Behera
    International Journal of Computer Vision, 2025, 133 (4) : 1755 - 1779
  • [25] Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition
    Sikdar, Arindam
    Liu, Yonghuai
    Kedarisetty, Siddhardha
    Zhao, Yitian
    Ahmed, Amr
    Behera, Ardhendu
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024,
  • [26] Channel Interaction Networks for Fine-Grained Image Categorization
    Gao, Yu
    Han, Xintong
    Wang, Xun
    Huang, Weilin
    Scott, Matthew R.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10818 - 10825
  • [27] Higher-order Integration of Hierarchical Convolutional Activations for Fine-grained Visual Categorization
    Cai, Sijia
    Zuo, Wangmeng
    Zhang, Lei
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 511 - 520
  • [28] Fine-Grained Background Representation for Weakly Supervised Semantic Segmentation
    Yin, Xu
    Im, Woobin
    Min, Dongbo
    Huo, Yuchi
    Pan, Fei
    Yoon, Sung-Eui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11739 - 11750
  • [29] Fine-Grained Categorization by Alignments
    Gavves, E.
    Fernando, B.
    Snoek, C. G. M.
    Smeulders, A. W. M.
    Tuytelaars, T.
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 1713 - 1720
  • [30] Fine-grained Visual Categorization with 2D-Warping
    Hanselmann, Harald
    Ney, Hermann
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 608 - 613