High-Order-Interaction for weakly supervised Fine-Grained Visual Categorization

被引:9
|
作者
Wang, Junzheng [1 ,5 ]
Li, Nanyu [3 ,4 ]
Luo, Zhiming [1 ,5 ]
Zhong, Zhun [2 ]
Li, Shaozi [1 ]
机构
[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China
[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy
[3] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Dept Comp Sci, Kunming, Yunnan, Peoples R China
[4] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China
[5] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Fine-Grained Visual Categorization; High-Order-Interaction; Trilinear pooling; ATTENTION;
D O I
10.1016/j.neucom.2021.08.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-Grained Visual Categorization (FGVC) is a challenging task due to the large intra-subcategory and small inter-subcategory variances. Recent studies tackle this task through a weakly supervised manner without using the part annotation from the experts. Of those, methods based on bilinear pooling are one of the main categories for computing the interaction between deep features and have shown high effectiveness. However, these methods mainly focus on the correlation within one specific layer but largely ignore the high interactions between multiple layers. In this study, we argue that considering the high interaction between the features from multiple layers can help to learn more distinguishing finegrained features. To this end, we propose a High-Order-Interaction (HOI) method for FGVC. In our HOI, an efficient cross-layer trilinear pooling is introduced to calculate the third-order interaction between three different layers. Third-order interactions of different combinations are then fused to form the final representation. HOI can produce more discriminative representations and be readily integrated with the two popular techniques, attention mechanism and triplet loss, to obtain superposed improvement. Extensive experiments conducted on four FGVC datasets show the great superiority of our method over bilinear-based methods and demonstrate that the proposed method achieves the state of the art. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:27 / 36
页数:10
相关论文
共 50 条
  • [31] Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization
    Deng, Xuran
    Liu, Chuanbin
    Lu, Zhiying
    MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 127 - 138
  • [32] Cross-X Learning for Fine-Grained Visual Categorization
    Luo, Wei
    Yang, Xitong
    Mo, Xianjie
    Lu, Yuheng
    Davis, Larry S.
    Li, Jun
    Yang, Jian
    Lim, Ser-Nam
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8241 - 8250
  • [33] A survey of fine-grained visual categorization based on deep learning
    XIE Yuxiang
    GONG Quanzhi
    LUAN Xidao
    YAN Jie
    ZHANG Jiahui
    Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1337 - 1356
  • [34] A Survey of Fine-Grained Visual Categorization Based on Deep Learning
    Xie, Yuxiang
    Gong, Quanzhi
    Luan, Xidao
    Yan, Jie
    Zhang, Jiahui
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1337 - 1356
  • [35] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
    Xu, Kunran
    Lai, Rui
    Gu, Lin
    Li, Yishi
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
  • [36] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
    Sun, Ming
    Yang, Jufeng
    Sun, Bo
    Wang, Kai
    2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
  • [37] Refined probability distribution module for fine-grained visual categorization
    Zhao, Peipei
    Miao, Qiguang
    Li, Hongsheng
    Liu, Ruyi
    Quan, Yining
    Song, Jianfeng
    NEUROCOMPUTING, 2023, 518 : 533 - 544
  • [38] Part-Stacked CNN for Fine-Grained Visual Categorization
    Huang, Shaoli
    Xu, Zhe
    Tao, Dacheng
    Zhang, Ya
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
  • [39] A Deep Sparse Coding Method for Fine-Grained Visual Categorization
    Guo, Lihua
    Guo, Chenggang
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 632 - 639
  • [40] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Xie, Fei
    Zhang, Yongdong
    Zhang, Dongming
    Su, Yu
    Tian, Qi
    2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367