Robustizing Object Detection Networks Using Augmented Feature Pooling

被引:0
|
作者
Shibata, Takashi [1 ]
Tanaka, Masayuki [2 ]
Okutomi, Masatoshi [2 ]
机构
[1] NTT Corp, Tokyo, Kanagawa, Japan
[2] Tokyo Inst Technol, Tokyo, Japan
来源
COMPUTER VISION - ACCV 2022, PT V | 2023年 / 13845卷
关键词
D O I
10.1007/978-3-031-26348-4_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a framework to robustize object detection networks against large geometric transformation. Deep neural networks rapidly and dramatically have improved object detection performance. Nevertheless, modern detection algorithms are still sensitive to large geometric transformation. Aiming at improving the robustness of the modern detection algorithms against the large geometric transformation, we propose a new feature extraction called augmented feature pooling. The key is to integrate the augmented feature maps obtained from the transformed images before feeding it to the detection head without changing the original network architecture. In this paper, we focus on rotation as a simple-yet-influential case of geometric transformation, while our framework is applicable to any geometric transformations. It is noteworthy that, with only adding a few lines of code from the original implementation of the modern object detection algorithms and applying simple fine-tuning, we can improve the rotation robustness of these original detection algorithms while inheriting modern network architectures' strengths. Our framework overwhelmingly outperforms typical geometric data augmentation and its variants used to improve robustness against appearance changes due to rotation. We construct a dataset based on MS COCO to evaluate the robustness of the rotation, called COCORot. Extensive experiments on three datasets, including our COCO-Rot, demonstrate that our method can improve the rotation robustness of state-of-the-art algorithms.
引用
收藏
页码:89 / 106
页数:18
相关论文
共 50 条
  • [41] ROBUST OBJECT DETECTION SCHEME USING FEATURE SELECTION
    Pan, Hong
    Xia, LiangZheng
    Nguyen, Truong Q.
    2010 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, 2010, : 849 - 852
  • [42] Generic Object Class Detection Using Feature Maps
    Danielsson, Oscar
    Carlsson, Stefan
    IMAGE ANALYSIS: 17TH SCANDINAVIAN CONFERENCE, SCIA 2011, 2011, 6688 : 348 - 359
  • [43] Object detection using haar feature selection optimization
    Demirkir, Cem
    Sankur, Bulent
    2006 IEEE 14TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1 AND 2, 2006, : 635 - +
  • [44] TEXTURELESS OBJECT DETECTION USING CUMULATIVE ORIENTATION FEATURE
    Konishi, Yoshinori
    Ijiri, Yoshihisa
    Suwa, Masaki
    Kawade, Masato
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 1310 - 1313
  • [45] Feature Pooling - A Feature Compression Method Used in Convolutional Neural Networks
    Pei, Ge
    Gao, Hai-Chang
    Zhou, Xin
    Cheng, Nuo
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (03) : 577 - 596
  • [46] Lane Marking Detection and Classification using Spatial-Temporal Feature Pooling
    Tabelini, Lucas
    Berriel, Rodrigo
    De Souza, Alberto F.
    Badue, Claudine
    Oliveira-Santos, Thiago
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [47] Object detection and feature base learning with sparse convolutional neural networks
    Gepperth, Alexander R. T.
    ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, PROCEEDINGS, 2006, 4087 : 221 - 232
  • [48] Cooperative LIDAR Object Detection via Feature Sharing in Deep Networks
    Marvasti, Ehsan Emad
    Raftari, Arash
    Marvasti, Amir Emad
    Fallah, Yaser P.
    Guo, Rui
    Lu, Hongsheng
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [49] UEFPN: Unified and Enhanced Feature Pyramid Networks for Small Object Detection
    Qiao, Ziteng
    Shi, Dianxi
    Yi, Xiaodong
    Shi, Yanyan
    Zhang, Yuhui
    Liu, Yangyang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [50] Object recognition and tracking using Bayesian networks for augmented reality systems
    Silva, RLS
    Rodrigues, PS
    Giraldi, G
    Cunha, G
    NINTH INTERNATIONAL CONFERENCE ON INFORMATION VISUALISATION, PROCEEDINGS, 2005, : 430 - 435