Coping with change: Learning invariant and minimum sufficient representations for fine-grained visual categorization

被引:3
|
作者
Ye, Shuo [1 ]
Yu, Shujian [2 ,3 ]
Hou, Wenjin [1 ]
Wang, Yu [1 ]
You, Xinge [1 ]
机构
[1] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan 430074, Huibei, Peoples R China
[2] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
[3] UiT The Arctic Univ Norway, Machine Learning Grp, Tromso, Norway
基金
国家重点研发计划;
关键词
Fine-grained visual categorization; Invariant risk minimization; Information bottleneck; ENTROPY;
D O I
10.1016/j.cviu.2023.103837
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained visual categorization (FGVC) is a challenging task due to similar visual appearances between various species. Previous studies always implicitly assume that the training and test data have the same underlying distributions, and that features extracted by modern backbone architectures remain discriminative and generalize well to unseen test data. However, we empirically justify that these conditions are not always true on benchmark datasets. To this end, we combine the merits of invariant risk minimization (IRM) and information bottleneck (IB) principle to learn invariant and minimum sufficient (IMS) representations for FGVC, such that the overall model can always discover the most succinct and consistent fine-grained features. We apply the matrix-based Renyi's..-order entropy to simplify and stabilize the training of IB; we also design a ''soft" environment partition scheme to make IRM applicable to FGVC task. To the best of our knowledge, we are the first to address the problem of FGVC from a generalization perspective and develop a new informationtheoretic solution accordingly. Extensive experiments demonstrate the consistent performance gain offered by our IMS. Code is available at: https://github.com/SYe- hub/IMS.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Part-Stacked CNN for Fine-Grained Visual Categorization
    Huang, Shaoli
    Xu, Zhe
    Tao, Dacheng
    Zhang, Ya
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1173 - 1182
  • [32] A Deep Sparse Coding Method for Fine-Grained Visual Categorization
    Guo, Lihua
    Guo, Chenggang
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 632 - 639
  • [33] Orientational Spatial Part Modeling for Fine-Grained Visual Categorization
    Yao, Hantao
    Zhang, Shiliang
    Xie, Fei
    Zhang, Yongdong
    Zhang, Dongming
    Su, Yu
    Tian, Qi
    2015 IEEE THIRD INTERNATIONAL CONFERENCE ON MOBILE SERVICES MS 2015, 2015, : 360 - 367
  • [34] Category attention transfer for efficient fine-grained visual categorization
    Liao, Qiyu
    Wang, Dadong
    Xu, Min
    PATTERN RECOGNITION LETTERS, 2022, 153 : 10 - 15
  • [35] Attentional Kernel Encoding Networks for Fine-Grained Visual Categorization
    Hu, Yutao
    Yang, Yandan
    Zhang, Jun
    Cao, Xianbin
    Zhen, Xiantong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (01) : 301 - 314
  • [36] Fine-Grained Visual Comparisons with Local Learning
    Yu, Aron
    Grauman, Kristen
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 192 - 199
  • [37] Fine-Grained Categorization Using a Mixture of Transfer Learning Networks
    Firsching, Justin
    Hashem, Sherif
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2021, VOL 2, 2022, 359 : 151 - 158
  • [38] ADAPTIVE MULTI-TASK LEARNING FOR FINE-GRAINED CATEGORIZATION
    Sun, Gang
    Chen, Yanyun
    Liu, Xuehui
    Wu, Enhua
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 996 - 1000
  • [39] To Know and To Learn About the Integration of Knowledge Representation and Deep Learning for Fine-Grained Visual Categorization
    Setti, Francesco
    PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2018), VOL 5: VISAPP, 2018, : 387 - 392
  • [40] Birdsnap: Large-scale Fine-grained Visual Categorization of Birds
    Berg, Thomas
    Liu, Jiongxin
    Lee, Seung Woo
    Alexander, Michelle L.
    Jacobs, David W.
    Belhumeur, Peter N.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2019 - 2026