RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection

被引:122
|
作者
Cheng, Gong [1 ]
Zhou, Peicheng [1 ]
Han, Junwei [1 ]
机构
[1] Northwestern Polytech Univ, Sch Automat, Xian, Peoples R China
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2016.315
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Thanks to the powerful feature representations obtained through deep convolutional neural network (CNN), the performance of object detection has recently been substantially boosted. Despite the remarkable success, the problems of object rotation, within-class variability, and between-class similarity remain several major challenges. To address these problems, this paper proposes a novel and effective method to learn a rotation-invariant and Fisher discriminative CNN (RIFD-CNN) model. This is achieved by introducing and learning a rotation-invariant layer and a Fisher discriminative layer, respectively, on the basis of the existing high-capacity CNN architectures. Specifically, the rotation-invariant layer is trained by imposing an explicit regularization constraint on the objective function that enforces invariance on the CNN features before and after rotating. The Fisher discriminative layer is trained by imposing the Fisher discrimination criterion on the CNN features so that they have small within-class scatter but large between-class separation. In the experiments, we comprehensively evaluate the proposed method for object detection task on a public available aerial image dataset and the PASCAL VOC 2007 dataset. State-of-the-art results are achieved compared with the existing baseline methods.
引用
收藏
页码:2884 / 2893
页数:10
相关论文
共 50 条
  • [1] Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection
    Cheng, Gong
    Han, Junwei
    Zhou, Peicheng
    Xu, Dong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (01) : 265 - 278
  • [2] Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images
    Cheng, Gong
    Zhou, Peicheng
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12): : 7405 - 7415
  • [3] RIC-CNN: Rotation-Invariant Coordinate Convolutional Neural Network
    Mo, Hanlin
    Zhao, Guoying
    [J]. PATTERN RECOGNITION, 2024, 146
  • [4] Rotation-invariant convolutional neural networks for galaxy morphology prediction
    Dieleman, Sander
    Willett, Kyle W.
    Dambre, Joni
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2015, 450 (02) : 1441 - 1459
  • [5] Rotation-Invariant Gait Identification with Quaternion Convolutional Neural Networks
    Jing, Bowen
    Prabhu, Vinay
    Gu, Angela
    Whaley, John
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15805 - 15806
  • [6] Rotation-Invariant Wafer Map Pattern Classification With Convolutional Neural Networks
    Kang, Seokho
    [J]. IEEE ACCESS, 2020, 8 : 170650 - 170658
  • [7] Multiscale Rotation-Invariant Convolutional Neural Networks for Lung Texture Classification
    Wang, Qiangchang
    Zheng, Yuanjie
    Yang, Gongping
    Jin, Weidong
    Chen, Xinjian
    Yin, Yilong
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2018, 22 (01) : 184 - 195
  • [8] Investigation of the generalizing capabilities of convolutional neural networks in forming rotation-invariant attributes
    Malashin, R. O.
    Kadykov, A. B.
    [J]. JOURNAL OF OPTICAL TECHNOLOGY, 2015, 82 (08) : 509 - 515
  • [9] A ROTATION-INVARIANT CONVOLUTIONAL NEURAL NETWORK FOR IMAGE ENHANCEMENT FORENSICS
    Chen, Yifang
    Lyu, Zixian
    Kang, Xiangui
    Wang, Z. Jane
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2111 - 2115
  • [10] Rotation invariant face detection using convolutional neural networks
    Tivive, Fok Hing Chi
    Bouzerdoum, Abdesselam
    [J]. NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 260 - 269