Robustizing Object Detection Networks Using Augmented Feature Pooling

被引:0
|
作者
Shibata, Takashi [1 ]
Tanaka, Masayuki [2 ]
Okutomi, Masatoshi [2 ]
机构
[1] NTT Corp, Tokyo, Kanagawa, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
关键词
D O I
10.1007/978-3-031-26348-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a framework to robustize object detection networks against large geometric transformation. Deep neural networks rapidly and dramatically have improved object detection performance. Nevertheless, modern detection algorithms are still sensitive to large geometric transformation. Aiming at improving the robustness of the modern detection algorithms against the large geometric transformation, we propose a new feature extraction called augmented feature pooling. The key is to integrate the augmented feature maps obtained from the transformed images before feeding it to the detection head without changing the original network architecture. In this paper, we focus on rotation as a simple-yet-influential case of geometric transformation, while our framework is applicable to any geometric transformations. It is noteworthy that, with only adding a few lines of code from the original implementation of the modern object detection algorithms and applying simple fine-tuning, we can improve the rotation robustness of these original detection algorithms while inheriting modern network architectures' strengths. Our framework overwhelmingly outperforms typical geometric data augmentation and its variants used to improve robustness against appearance changes due to rotation. We construct a dataset based on MS COCO to evaluate the robustness of the rotation, called COCORot. Extensive experiments on three datasets, including our COCO-Rot, demonstrate that our method can improve the rotation robustness of state-of-the-art algorithms.
引用
收藏
页码:89 / 106
页数:18
相关论文
共 50 条
  • [1] Object Detection Oriented Feature Pooling for Video Semantic Indexing
    Ueki, Kazuya
    Kobayashi, Tetsunori
    PROCEEDINGS OF THE 12TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISIGRAPP 2017), VOL 5, 2017, : 44 - 51
  • [2] Precise object detection using adversarially augmented local/global feature fusion
    Han, Xiaobing
    He, Tiantian
    Ong, Yew-Soon
    Zhong, Yanfei
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 94
  • [3] Precise object detection using adversarially augmented local/global feature fusion
    Han, Xiaobing
    He, Tiantian
    Ong, Yew-Soon
    Zhong, Yanfei
    Engineering Applications of Artificial Intelligence, 2020, 94
  • [4] Small Object Detection Using Deep Feature Pyramid Networks
    Liang, Zhenwen
    Shao, Jie
    Zhang, Dongyang
    Gao, Lianli
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 554 - 564
  • [5] Feature Selective Networks for Object Detection
    Zhai, Yao
    Fu, Jingjing
    Lu, Yan
    Li, Houqiang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4139 - 4147
  • [6] Feature Pyramid Networks for Object Detection
    Lin, Tsung-Yi
    Dollar, Piotr
    Girshick, Ross
    He, Kaiming
    Hariharan, Bharath
    Belongie, Serge
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 936 - 944
  • [7] A pooling-based feature pyramid network for salient object detection
    Shi, Caijuan
    Zhang, Weiming
    Duan, Changyu
    Chen, Houru
    IMAGE AND VISION COMPUTING, 2021, 107
  • [8] CP-RCNN: Lidar Object Detection with Feature Pooling and Abstraction
    Zhou, Hang
    Ji, Yiding
    2024 18TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, ICARCV, 2024, : 821 - 826
  • [9] Object Detection in Aerial Images Using Feature Fusion Deep Networks
    Long, Hao
    Chung, Yinung
    Liu, Zhenbao
    Bu, Shuhui
    IEEE ACCESS, 2019, 7 : 30980 - 30990
  • [10] Adaptive Feature Pyramid Networks for Object Detection
    Wang, Chengyang
    Zhong, Caiming
    IEEE ACCESS, 2021, 9 : 107024 - 107032