Robustizing Object Detection Networks Using Augmented Feature Pooling

被引:0
|
作者
Shibata, Takashi [1 ]
Tanaka, Masayuki [2 ]
Okutomi, Masatoshi [2 ]
机构
[1] NTT Corp, Tokyo, Kanagawa, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
关键词
D O I
10.1007/978-3-031-26348-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a framework to robustize object detection networks against large geometric transformation. Deep neural networks rapidly and dramatically have improved object detection performance. Nevertheless, modern detection algorithms are still sensitive to large geometric transformation. Aiming at improving the robustness of the modern detection algorithms against the large geometric transformation, we propose a new feature extraction called augmented feature pooling. The key is to integrate the augmented feature maps obtained from the transformed images before feeding it to the detection head without changing the original network architecture. In this paper, we focus on rotation as a simple-yet-influential case of geometric transformation, while our framework is applicable to any geometric transformations. It is noteworthy that, with only adding a few lines of code from the original implementation of the modern object detection algorithms and applying simple fine-tuning, we can improve the rotation robustness of these original detection algorithms while inheriting modern network architectures' strengths. Our framework overwhelmingly outperforms typical geometric data augmentation and its variants used to improve robustness against appearance changes due to rotation. We construct a dataset based on MS COCO to evaluate the robustness of the rotation, called COCORot. Extensive experiments on three datasets, including our COCO-Rot, demonstrate that our method can improve the rotation robustness of state-of-the-art algorithms.
引用
收藏
页码:89 / 106
页数:18
相关论文
共 50 条
  • [31] Spatial Parsing and Dynamic Temporal Pooling networks for Human-Object Interaction detection
    Li, Hongsheng
    Zhu, Guangming
    Zhen, Wu
    Ni, Lan
    Shen, Peiyi
    Zhang, Liang
    Wang, Ning
    Hua, Cong
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [32] RodNet: An Advanced Multidomain Object Detection Approach Using Feature Transformation With Generative Adversarial Networks
    Jaw, Da-Wei
    Huang, Shih-Chia
    Lin, I-Chuan
    Zhang, Cheng
    Huang, Ching-Chun
    Kuo, Sy-Yen
    IEEE SENSORS JOURNAL, 2023, 23 (15) : 17531 - 17540
  • [33] Unsupervised feature selection for multi-class object detection using convolutional neural networks
    Matsugu, M
    Cardon, P
    ADVANCES IN NEURAL NETWORKS - ISNN 2004, PT 1, 2004, 3173 : 864 - 869
  • [34] RAOD: refined oriented detector with augmented feature in remote sensing images object detection
    Shi, Qin
    Zhu, Yu
    Fang, Chuantao
    Wang, Nan
    Lin, Jiajun
    APPLIED INTELLIGENCE, 2022, 52 (13) : 15278 - 15294
  • [35] RAOD: refined oriented detector with augmented feature in remote sensing images object detection
    Qin Shi
    Yu Zhu
    Chuantao Fang
    Nan Wang
    Jiajun Lin
    Applied Intelligence, 2022, 52 : 15278 - 15294
  • [36] Hierarchical Feature Pooling Transformer for Efficient UAV Object Tracking
    Wang, Haijun
    Ma, Wenlai
    Zhang, Shengyan
    Hao, Wei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [37] Object Level Deep Feature Pooling for Compact Image Representation
    Mopuri, Konda Reddy
    Babu, R. Venkatesh
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [38] Object Detection using Template and HOG Feature Matching
    Sultana, Marjia
    Ahmed, Tasniya
    Chakraborty, Partha
    Khatun, Mahmuda
    Hasan, Md Rakib
    Uddin, Mohammad Shorif
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (07) : 233 - 238
  • [39] Object Detection using Color Clue and Shape Feature
    Gode, Chetan S.
    Khobragade, Atish S.
    PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 464 - 468
  • [40] Object Detection Using a Single Extended Feature Map
    Lim, Young-Chul
    Kang, Minsung
    2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2018, : 820 - 825