Towards Interpretable Object Detection by Unfolding Latent Structures

被引：14

作者：

Wu, Tianfu ^{[1
,2
]}

Song, Xi

机构：

[1] NC State Univ, Dept ECE, Raleigh, NC 27695 USA

[2] NC State Univ, Visual Narrat Initiat, Raleigh, NC 27695 USA

来源：

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年

关键词：

D O I：

10.1109/ICCV.2019.00613

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper first proposes a method of formulating model interpretability in visual understanding tasks based on the idea of unfolding latent structures. It then presents a case study in object detection using popular two-stage regionbased convolutional network (i.e., R-CNN) detection systems [19, 50, 7, 23]. The proposed method focuses on weakly-supervised extractive rationale generation, that is learning to unfold latent discriminative part configurations of object instances automatically and simultaneously in detection without using any supervision for part configurations. It utilizes a top-down hierarchical and compositional grammar model embedded in a directed acyclic AND-OR Graph (AOG) to explore and unfold the space of latent part configurations of regions of interest (RoIs). It presents an AOGParsing operator that seamlessly integrates with the RoIPooling [19]/RoIAlign [23] operator widely used in R-CNN and is trained end-to-end. In object detection, a bounding box is interpreted by the best parse tree derived from the AOG on-the-fly, which is treated as the qualitatively extractive rationale generated for interpreting detection. In experiments, Faster R-CNN [50] is used to test the proposed method on the PASCAL VOC 2007 [13] and the COCO 2017 [40] object detection datasets. The experimental results show that the proposed method can compute promising latent structures without hurting the performance. The code and pretrained models are available at https://github.com/ iVMCL/iRCNN.

引用

页码：6032 / 6042

页数：11

共 50 条

[21] Towards lightweight military object detection
Li Z.
Nian W.
Sun X.
Li S.
Journal of Intelligent and Fuzzy Systems, 2024, 46 (04): : 10329 - 10343
[22] Towards Better Explanations for Object Detection
Van Binh Truong
Truong Thanh Hung Nguyen
Vo Thanh Khang Nguyen
Quoc Khanh Nguyen
Quoc Hung Cao
ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222, 2023, 222
[23] Towards Interpretable Machine-Learning-Based DDoS Detection
Zhou Q.
Li R.
Xu L.
Nallanathan A.
Yang J.
Fu A.
SN Computer Science, 5 (1)
[24] Towards Trustworthy Rumor Detection with Interpretable Graph Structural Learning
Liu, Leyuan
Chen, Junyi
Cheng, Zhangtao
Tai, Wenxin
Zhou, Fan
PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 4089 - 4093
[25] On the Road With 16 Neurons: Towards Interpretable and Manipulable Latent Representations for Visual Predictions in Driving Scenarios
Plebe, Alice
Lio, Mauro Da
IEEE ACCESS, 2020, 8 : 179716 - 179734
[26] Interpretable CNNs for Object Classification
Zhang, Quanshi
Wang, Xin
Wu, Ying Nian
Zhou, Huilin
Zhu, Song-Chun
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (10) : 3416 - 3431
[27] Perioperative Predictions with Interpretable Latent Representation
Xue, Bing
Jiao, York
Kannampallil, Thomas
Fritz, Bradley
King, Christopher
Abraham, Joanna
Avidan, Michael
Lu, Chenyang
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 4268 - 4278
[28] Interpretable Latent Space for Meteorological Out-of-Distribution Detection via Weak Supervision
Das, Suman
Yuhas, Michael
Koh, Rachel
Easwaran, Arvind
ACM TRANSACTIONS ON CYBER-PHYSICAL SYSTEMS, 2024, 8 (02) : 1 - 26
[29] Subcategory Clustering with Latent Feature Alignment and Filtering for Object Detection
Ruan, Zhiwei
Wang, Guijin
Xue, Jing-Hao
Lin, Xinggang
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (02) : 244 - 248
[30] 3D Object Detection with Latent Support Surfaces
Ren, Zhile
Sudderth, Erik B.
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 937 - 946

← 1 2 3 4 5 →