Towards Interpretable Object Detection by Unfolding Latent Structures

被引:14
|
作者
Wu, Tianfu [1 ,2 ]
Song, Xi
机构
[1] NC State Univ, Dept ECE, Raleigh, NC 27695 USA
[2] NC State Univ, Visual Narrat Initiat, Raleigh, NC 27695 USA
关键词
D O I
10.1109/ICCV.2019.00613
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper first proposes a method of formulating model interpretability in visual understanding tasks based on the idea of unfolding latent structures. It then presents a case study in object detection using popular two-stage regionbased convolutional network (i.e., R-CNN) detection systems [19, 50, 7, 23]. The proposed method focuses on weakly-supervised extractive rationale generation, that is learning to unfold latent discriminative part configurations of object instances automatically and simultaneously in detection without using any supervision for part configurations. It utilizes a top-down hierarchical and compositional grammar model embedded in a directed acyclic AND-OR Graph (AOG) to explore and unfold the space of latent part configurations of regions of interest (RoIs). It presents an AOGParsing operator that seamlessly integrates with the RoIPooling [19]/RoIAlign [23] operator widely used in R-CNN and is trained end-to-end. In object detection, a bounding box is interpreted by the best parse tree derived from the AOG on-the-fly, which is treated as the qualitatively extractive rationale generated for interpreting detection. In experiments, Faster R-CNN [50] is used to test the proposed method on the PASCAL VOC 2007 [13] and the COCO 2017 [40] object detection datasets. The experimental results show that the proposed method can compute promising latent structures without hurting the performance. The code and pretrained models are available at https://github.com/ iVMCL/iRCNN.
引用
收藏
页码:6032 / 6042
页数:11
相关论文
共 50 条
  • [21] Towards lightweight military object detection
    Li Z.
    Nian W.
    Sun X.
    Li S.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 10329 - 10343
  • [22] Towards Better Explanations for Object Detection
    Van Binh Truong
    Truong Thanh Hung Nguyen
    Vo Thanh Khang Nguyen
    Quoc Khanh Nguyen
    Quoc Hung Cao
    ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
  • [23] Towards Interpretable Machine-Learning-Based DDoS Detection
    Zhou Q.
    Li R.
    Xu L.
    Nallanathan A.
    Yang J.
    Fu A.
    SN Computer Science, 5 (1)
  • [24] Towards Trustworthy Rumor Detection with Interpretable Graph Structural Learning
    Liu, Leyuan
    Chen, Junyi
    Cheng, Zhangtao
    Tai, Wenxin
    Zhou, Fan
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4089 - 4093
  • [25] On the Road With 16 Neurons: Towards Interpretable and Manipulable Latent Representations for Visual Predictions in Driving Scenarios
    Plebe, Alice
    Lio, Mauro Da
    IEEE ACCESS, 2020, 8 : 179716 - 179734
  • [26] Interpretable CNNs for Object Classification
    Zhang, Quanshi
    Wang, Xin
    Wu, Ying Nian
    Zhou, Huilin
    Zhu, Song-Chun
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3416 - 3431
  • [27] Perioperative Predictions with Interpretable Latent Representation
    Xue, Bing
    Jiao, York
    Kannampallil, Thomas
    Fritz, Bradley
    King, Christopher
    Abraham, Joanna
    Avidan, Michael
    Lu, Chenyang
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4268 - 4278
  • [28] Interpretable Latent Space for Meteorological Out-of-Distribution Detection via Weak Supervision
    Das, Suman
    Yuhas, Michael
    Koh, Rachel
    Easwaran, Arvind
    ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2024, 8 (02) : 1 - 26
  • [29] Subcategory Clustering with Latent Feature Alignment and Filtering for Object Detection
    Ruan, Zhiwei
    Wang, Guijin
    Xue, Jing-Hao
    Lin, Xinggang
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 244 - 248
  • [30] 3D Object Detection with Latent Support Surfaces
    Ren, Zhile
    Sudderth, Erik B.
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 937 - 946