Robustizing Object Detection Networks Using Augmented Feature Pooling

被引:0
|
作者
Shibata, Takashi [1 ]
Tanaka, Masayuki [2 ]
Okutomi, Masatoshi [2 ]
机构
[1] NTT Corp, Tokyo, Kanagawa, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
COMPUTER VISION - ACCV 2022, PT V | 2023年 / 13845卷
关键词
D O I
10.1007/978-3-031-26348-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a framework to robustize object detection networks against large geometric transformation. Deep neural networks rapidly and dramatically have improved object detection performance. Nevertheless, modern detection algorithms are still sensitive to large geometric transformation. Aiming at improving the robustness of the modern detection algorithms against the large geometric transformation, we propose a new feature extraction called augmented feature pooling. The key is to integrate the augmented feature maps obtained from the transformed images before feeding it to the detection head without changing the original network architecture. In this paper, we focus on rotation as a simple-yet-influential case of geometric transformation, while our framework is applicable to any geometric transformations. It is noteworthy that, with only adding a few lines of code from the original implementation of the modern object detection algorithms and applying simple fine-tuning, we can improve the rotation robustness of these original detection algorithms while inheriting modern network architectures' strengths. Our framework overwhelmingly outperforms typical geometric data augmentation and its variants used to improve robustness against appearance changes due to rotation. We construct a dataset based on MS COCO to evaluate the robustness of the rotation, called COCORot. Extensive experiments on three datasets, including our COCO-Rot, demonstrate that our method can improve the rotation robustness of state-of-the-art algorithms.
引用
收藏
页码:89 / 106
页数:18
相关论文
共 50 条
  • [21] Temporal Feature Networks for CNN based Object Detection
    Weber, Michael
    Wald, Tassilo
    Zoellner, J. Marius
    2021 32ND IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2021, : 1478 - 1484
  • [22] Adaptive Region Pooling for Object Detection
    Tsai, Yi-Hsuan
    Hamsici, Onur C.
    Yang, Ming-Hsuan
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 731 - 739
  • [23] Unsupervised Feature Propagation for Fast Video Object Detection Using Generative Adversarial Networks
    Zhang, Xuan
    Han, Guangxing
    He, Wenduo
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 617 - 627
  • [24] 3D Object Detection Using Scale Invariant and Feature Reweighting Networks
    Zhao, Xin
    Liu, Zhe
    Hu, Ruolan
    Huang, Kaiqi
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 9267 - 9274
  • [25] AFAN: Augmented Feature Alignment Network for Cross-Domain Object Detection
    Wang, Hongsong
    Liao, Shengcai
    Shao, Ling
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4046 - 4056
  • [26] Object recognition using segmentation for feature detection
    Fussenegger, M
    Opelt, A
    Pinz, A
    Auer, P
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, 2004, : 41 - 44
  • [27] Object detection using feature subset selection
    Sun, ZH
    Bebis, G
    Miller, R
    PATTERN RECOGNITION, 2004, 37 (11) : 2165 - 2176
  • [28] Boosting object detection using feature selection
    Sun, ZH
    Bebis, G
    Miller, R
    IEEE CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE, PROCEEDINGS, 2003, : 290 - 296
  • [29] Information-Interaction Feature Pyramid Networks for Object Detection
    Hu, Jie
    Xie, Lihao
    Gu, Xiaoai
    Xu, Wencai
    Chang, Minjie
    Xu, Boyuan
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1301 - 1306
  • [30] Stair-Step Feature Pyramid Networks for Object Detection
    Vo, Xuan-Thuy
    Tran, Tien-Dat
    Nguyen, Duy-Linh
    Jo, Kang-Hyun
    FRONTIERS OF COMPUTER VISION, IW-FCV 2021, 2021, 1405 : 168 - 175