Towards Learning Affine-Invariant Representations via Data-Efficient CNNs

被引:0
|
作者
Xu, Wenju [1 ,4 ]
Wang, Guanghui [1 ,4 ]
Sullivan, Alan [2 ]
Zhang, Ziming [3 ]
机构
[1] Univ Kansas, Lawrence, KS 66045 USA
[2] Mitsubishi Elect Res Labs MERL, Cambridge, MA USA
[3] Worcester Polytech Inst, Worcester, MA 01609 USA
[4] MERL, Cambridge, MA 02139 USA
关键词
RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose integrating a priori knowledge into both design and training of convolutional neural networks (CNNs) to learn object representations that are invariant to affine transformations (i.e. translation, scale, rotation). Accordingly we propose a novel multi-scale maxout CNN and train it end-to-end with a novel rotation-invariant regularizer. This regularizer aims to enforce the weights in each 2D spatial filter to approximate circular patterns. In this way, we manage to handle affine transformations in training using convolution, multi-scale maxout, and circular filters. Empirically we demonstrate that such knowledge can significantly improve the data-efficiency as well as generalization and robustness of learned models. For instance, on the Traffic Sign data set and trained with only 10 images per class, our method can achieve 84.15% that outperforms the state-of-the-art by 29.80% in terms of test accuracy.
引用
收藏
页码:893 / 902
页数:10
相关论文
共 50 条
  • [21] Black-box attacks on face recognition via affine-invariant training
    Sun, Bowen
    Su, Hang
    Zheng, Shibao
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (15): : 8549 - 8564
  • [22] Robust feature matching for geospatial images via an affine-invariant coordinate system
    Li, Jiayuan
    Hu, Qingwu
    Ai, Mingyao
    PHOTOGRAMMETRIC RECORD, 2017, 32 (159): : 317 - 331
  • [23] Affine-invariant Recognition of Handwritten Characters via Accelerated KL Divergence Minimization
    Wakahara, Toru
    Yamashita, Yukihiko
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1095 - 1099
  • [24] Automatic Production of Deep Learning Benchmark Dataset for Affine-Invariant Feature Matching
    Yao, Guobiao
    Zhang, Jin
    Gong, Jianya
    Jin, Fengxiang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2023, 12 (02)
  • [25] Black-box attacks on face recognition via affine-invariant training
    Bowen Sun
    Hang Su
    Shibao Zheng
    Neural Computing and Applications, 2024, 36 : 8549 - 8564
  • [26] Affine-invariant querying of spatial data using a triangle-based logic
    Haesevoets, Sofie
    Kuijpers, Bart
    Revesz, Peter Z.
    GEOINFORMATICA, 2020, 24 (04) : 849 - 879
  • [27] Affine-invariant querying of spatial data using a triangle-based logic
    Sofie Haesevoets
    Bart Kuijpers
    Peter Z. Revesz
    GeoInformatica, 2020, 24 : 849 - 879
  • [28] Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment
    Srinath, Suhas
    Mitra, Shankhanil
    Rao, Shika
    Soundararajan, Rajiv
    2024 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION, WACV 2024, 2024, : 22 - 31
  • [29] A Survey of Data-Efficient Graph Learning
    Ju, Wei
    Yi, Siyu
    Wang, Yifan
    Long, Qingqing
    Luo, Junyu
    Xiao, Zhiping
    Zhang, Ming
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 8104 - 8113
  • [30] Uniform Priors for Data-Efficient Learning
    Sinha, Samarth
    Roth, Karsten
    Goyal, Anirudh
    Ghassemi, Marzyeh
    Akata, Zeynep
    Larochelle, Hugo
    Garg, Animesh
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 4026 - 4037