Robustizing Object Detection Networks Using Augmented Feature Pooling

被引：0

作者：

Shibata, Takashi ^{[1
]}

Tanaka, Masayuki ^{[2
]}

Okutomi, Masatoshi ^{[2
]}

机构：

[1] NTT Corp, Tokyo, Kanagawa, Japan

[2] Tokyo Inst Technol, Tokyo, Japan

来源：

COMPUTER VISION - ACCV 2022, PT V | 2023年 / 13845卷

关键词：

D O I：

10.1007/978-3-031-26348-4_6

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a framework to robustize object detection networks against large geometric transformation. Deep neural networks rapidly and dramatically have improved object detection performance. Nevertheless, modern detection algorithms are still sensitive to large geometric transformation. Aiming at improving the robustness of the modern detection algorithms against the large geometric transformation, we propose a new feature extraction called augmented feature pooling. The key is to integrate the augmented feature maps obtained from the transformed images before feeding it to the detection head without changing the original network architecture. In this paper, we focus on rotation as a simple-yet-influential case of geometric transformation, while our framework is applicable to any geometric transformations. It is noteworthy that, with only adding a few lines of code from the original implementation of the modern object detection algorithms and applying simple fine-tuning, we can improve the rotation robustness of these original detection algorithms while inheriting modern network architectures' strengths. Our framework overwhelmingly outperforms typical geometric data augmentation and its variants used to improve robustness against appearance changes due to rotation. We construct a dataset based on MS COCO to evaluate the robustness of the rotation, called COCORot. Extensive experiments on three datasets, including our COCO-Rot, demonstrate that our method can improve the rotation robustness of state-of-the-art algorithms.

引用

页码：89 / 106

页数：18

共 50 条

[21] Temporal Feature Networks for CNN based Object Detection
Weber, Michael
Wald, Tassilo
Zoellner, J. Marius
2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1478 - 1484
[22] Adaptive Region Pooling for Object Detection
Tsai, Yi-Hsuan
Hamsici, Onur C.
Yang, Ming-Hsuan
2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 731 - 739
[23] Unsupervised Feature Propagation for Fast Video Object Detection Using Generative Adversarial Networks
Zhang, Xuan
Han, Guangxing
He, Wenduo
MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 617 - 627
[24] 3D Object Detection Using Scale Invariant and Feature Reweighting Networks
Zhao, Xin
Liu, Zhe
Hu, Ruolan
Huang, Kaiqi
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9267 - 9274
[25] AFAN: Augmented Feature Alignment Network for Cross-Domain Object Detection
Wang, Hongsong
Liao, Shengcai
Shao, Ling
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4046 - 4056
[26] Object recognition using segmentation for feature detection
Fussenegger, M
Opelt, A
Pinz, A
Auer, P
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 41 - 44
[27] Object detection using feature subset selection
Sun, ZH
Bebis, G
Miller, R
PATTERN RECOGNITION, 2004, 37 (11) : 2165 - 2176
[28] Boosting object detection using feature selection
Sun, ZH
Bebis, G
Miller, R
IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 290 - 296
[29] Information-Interaction Feature Pyramid Networks for Object Detection
Hu, Jie
Xie, Lihao
Gu, Xiaoai
Xu, Wencai
Chang, Minjie
Xu, Boyuan
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1301 - 1306
[30] Stair-Step Feature Pyramid Networks for Object Detection
Vo, Xuan-Thuy
Tran, Tien-Dat
Nguyen, Duy-Linh
Jo, Kang-Hyun
FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 168 - 175

← 1 2 3 4 5 →