High-Order-Interaction for weakly supervised Fine-Grained Visual Categorization

被引：9

作者：

Wang, Junzheng ^{[1
,5
]}

Li, Nanyu ^{[3
,4
]}

Luo, Zhiming ^{[1
,5
]}

Zhong, Zhun ^{[2
]}

Li, Shaozi ^{[1
]}

机构：

[1] Xiamen Univ, Sch Informat, Dept Artificial Intelligence, Xiamen, Peoples R China

[2] Univ Trento, Dept Informat Engn & Comp Sci, Trento, Italy

[3] Kunming Univ Sci & Technol, Sch Informat Engn & Automat, Dept Comp Sci, Kunming, Yunnan, Peoples R China

[4] Peking Univ, Wangxuan Inst Comp Technol, Beijing, Peoples R China

[5] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 464卷

基金：

中国博士后科学基金; 中国国家自然科学基金;

关键词：

Fine-Grained Visual Categorization; High-Order-Interaction; Trilinear pooling; ATTENTION;

D O I：

10.1016/j.neucom.2021.08.108

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fine-Grained Visual Categorization (FGVC) is a challenging task due to the large intra-subcategory and small inter-subcategory variances. Recent studies tackle this task through a weakly supervised manner without using the part annotation from the experts. Of those, methods based on bilinear pooling are one of the main categories for computing the interaction between deep features and have shown high effectiveness. However, these methods mainly focus on the correlation within one specific layer but largely ignore the high interactions between multiple layers. In this study, we argue that considering the high interaction between the features from multiple layers can help to learn more distinguishing finegrained features. To this end, we propose a High-Order-Interaction (HOI) method for FGVC. In our HOI, an efficient cross-layer trilinear pooling is introduced to calculate the third-order interaction between three different layers. Third-order interactions of different combinations are then fused to form the final representation. HOI can produce more discriminative representations and be readily integrated with the two popular techniques, attention mechanism and triplet loss, to obtain superposed improvement. Extensive experiments conducted on four FGVC datasets show the great superiority of our method over bilinear-based methods and demonstrate that the proposed method achieves the state of the art. (C) 2021 Elsevier B.V. All rights reserved.

引用

页码：27 / 36

页数：10

共 50 条

[31] Recombining Vision Transformer Architecture for Fine-Grained Visual Categorization
Deng, Xuran
Liu, Chuanbin
Lu, Zhiying
MULTIMEDIA MODELING, MMM 2023, PT II, 2023, 13834 : 127 - 138
[32] Cross-X Learning for Fine-Grained Visual Categorization
Luo, Wei
Yang, Xitong
Mo, Xianjie
Lu, Yuheng
Davis, Larry S.
Li, Jun
Yang, Jian
Lim, Ser-Nam
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8241 - 8250
[33] A survey of fine-grained visual categorization based on deep learning
XIE Yuxiang
GONG Quanzhi
LUAN Xidao
YAN Jie
ZHANG Jiahui
Journal of Systems Engineering and Electronics, 2024, 35 (06) : 1337 - 1356
[34] A Survey of Fine-Grained Visual Categorization Based on Deep Learning
Xie, Yuxiang
Gong, Quanzhi
Luan, Xidao
Yan, Jie
Zhang, Jiahui
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2024, 35 (06) : 1337 - 1356
[35] Multiresolution Discriminative Mixup Network for Fine-Grained Visual Categorization
Xu, Kunran
Lai, Rui
Gu, Lin
Li, Yishi
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (07) : 3488 - 3500
[36] SHAPE-GUIDED SEGMENTATION FOR FINE-GRAINED VISUAL CATEGORIZATION
Sun, Ming
Yang, Jufeng
Sun, Bo
Wang, Kai
2016 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2016,
[37] Refined probability distribution module for fine-grained visual categorization
Zhao, Peipei
Miao, Qiguang
Li, Hongsheng
Liu, Ruyi
Quan, Yining
Song, Jianfeng
NEUROCOMPUTING, 2023, 518 : 533 - 544
[38] Part-Stacked CNN for Fine-Grained Visual Categorization
Huang, Shaoli
Xu, Zhe
Tao, Dacheng
Zhang, Ya
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
[39] A Deep Sparse Coding Method for Fine-Grained Visual Categorization
Guo, Lihua
Guo, Chenggang
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 632 - 639
[40] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
Yao, Hantao
Zhang, Shiliang
Xie, Fei
Zhang, Yongdong
Zhang, Dongming
Su, Yu
Tian, Qi
2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367

← 1 2 3 4 5 →