PDiscoNet: Semantically consistent part discovery for fine-grained recognition

被引:0
|
作者
van der Klis, Robert [1 ]
Alaniz, Stephan [2 ]
Mancini, Massimiliano [3 ]
Dantas, Cassio F. [4 ,6 ]
Ienco, Dino [4 ,6 ]
Akata, Zeynep [2 ]
Marcos, Diego [5 ,6 ]
机构
[1] WUR, Wageningen, Netherlands
[2] Univ Tubingen, Tubingen, Germany
[3] Univ Trento, Trento, Italy
[4] INRAE, UMR TETIS, Paris, France
[5] Inria, Paris, France
[6] Univ Montpellier, Montpellier, France
关键词
D O I
10.1109/ICCV51070.2023.00179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fine-grained classification often requires recognizing specific object parts, such as beak shape and wing patterns for birds. Encouraging a fine-grained classification model to first detect such parts and then using them to infer the class could help us gauge whether the model is indeed looking at the right details better than with interpretability methods that provide a single attribution map. We propose PDiscoNet to discover object parts by using only image-level class labels along with priors encouraging the parts to be: discriminative, compact, distinct from each other, equivariant to rigid transforms, and active in at least some of the images. In addition to using the appropriate losses to encode these priors, we propose to use part-dropout, where full part feature vectors are dropped at once to prevent a single part from dominating in the classification, and part feature vector modulation, which makes the information coming from each part distinct from the perspective of the classifier. Our results on CUB, CelebA, and PartImageNet show that the proposed method provides substantially better part discovery performance than previous methods while not requiring any additional hyper-parameter tuning and without penalizing the classification performance. The code is available at https://github.com/robertdvdk/part_detection
引用
收藏
页码:1866 / 1876
页数:11
相关论文
共 50 条
  • [1] PARTICLE: Part Discovery and Contrastive Learning for Fine-grained Recognition
    Saha, Oindrila
    Maji, Subhransu
    [J]. 2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 167 - 176
  • [2] Fine-Grained Crowdsourcing for Fine-Grained Recognition
    Jia Deng
    Krause, Jonathan
    Li Fei-Fei
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 580 - 587
  • [3] Nonparametric Part Transfer for Fine-grained Recognition
    Goering, Christoph
    Rodner, Erik
    Freytag, Alexander
    Denzler, Joachim
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2489 - 2496
  • [4] Fine-Grained Recognition without Part Annotations
    Krause, Jonathan
    Jin, Hailin
    Yang, Jianchao
    Li Fei-Fei
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 5546 - 5555
  • [5] Learning Scale-Consistent Attention Part Network for Fine-Grained Image Recognition
    Liu, Huabin
    Li, Jianguo
    Li, Dian
    See, John
    Lin, Weiyao
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2902 - 2913
  • [6] ITERATIVE OBJECT AND PART TRANSFER FOR FINE-GRAINED RECOGNITION
    Shen, Zhiqiang
    Jiang, Yu-Gang
    Wang, Dequan
    Xue, Xiangyang
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 1470 - 1475
  • [7] Audio Visual Attribute Discovery for Fine-Grained Object Recognition
    Zhang, Hua
    Cao, Xiaochun
    Wang, Rui
    [J]. THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7542 - 7549
  • [8] Deformable Part Descriptors for Fine-grained Recognition and Attribute Prediction
    Zhang, Ning
    Farrell, Ryan
    Iandola, Forrest
    Darrell, Trevor
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 729 - 736
  • [9] MINE THE FINE: FINE-GRAINED FRAGMENT DISCOVERY
    Kiapour, M. Hadi
    Di, Wei
    Jagadeesh, Vignesh
    Piramuthu, Robinson
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 3555 - 3559
  • [10] Part-Guided Relational Transformers for Fine-Grained Visual Recognition
    Zhao, Yifan
    Li, Jia
    Chen, Xiaowu
    Tian, Yonghong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 9470 - 9481