Deformable Convolutional Networks

被引:3715
|
作者
Dai, Jifeng [1 ]
Qi, Haozhi [1 ]
Xiong, Yuwen [1 ]
Li, Yi [1 ]
Zhang, Guodong [1 ]
Hu, Han [1 ]
Wei, Yichen [1 ]
机构
[1] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
10.1109/ICCV.2017.89
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in their building modules. In this work, we introduce two new modules to enhance the transformation modeling capability of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from the target tasks, without additional supervision. The new modules can readily replace their plain counterparts in existing CNNs and can be easily trained end-to-end by standard back-propagation, giving rise to deformable convolutional networks. Extensive experiments validate the performance of our approach. For the first time, we show that learning dense spatial transformation in deep CNNs is effective for sophisticated vision tasks such as object detection and semantic segmentation. The code is released at https://github.com/msracver/Deformable-ConvNets.
引用
收藏
页码:764 / 773
页数:10
相关论文
共 50 条
  • [1] Deformable Graph Convolutional Networks
    Park, Jinyoung
    Yoo, Sungdong
    Park, Jihwan
    Kim, Hyunwoo J.
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7949 - 7956
  • [2] Deformable Part Models are Convolutional Neural Networks
    Girshick, Ross
    Iandola, Forrest
    Darrell, Trevor
    Malik, Jitendra
    [J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 437 - 446
  • [3] Learning Conditional Deformable Templates with Convolutional Networks
    Dalca, Adrian V.
    Rakic, Marianne
    Guttag, John
    Sabuncu, Mert R.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [4] Deformable Convolutional Neural Networks for Hyperspectral Image Classification
    Zhu, Jian
    Fang, Leyuan
    Ghamisi, Pedram
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (08) : 1254 - 1258
  • [5] Deformable image registration using convolutional neural networks
    Eppenhof, Koen A. J.
    Lafarge, Maxime W.
    Moeskops, Pim
    Veta, Mitko
    Pluim, Josien P. W.
    [J]. MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
  • [6] Monaural Speech Dereverberation Using Deformable Convolutional Networks
    Kothapally, Vinay
    Hansen, John H. L.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1712 - 1723
  • [7] EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
    Wang, Xintao
    Chan, Kelvin C. K.
    Yu, Ke
    Dong, Chao
    Loy, Chen Change
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1954 - 1963
  • [8] AN EFFICIENT ACCELERATOR DESIGN METHODOLOGY FOR DEFORMABLE CONVOLUTIONAL NETWORKS
    Ahn, Saehyun
    Chang, Jung-Woo
    Kang, Suk-Ju
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3075 - 3079
  • [9] Progressively Trained Convolutional Neural Networks for Deformable Image Registration
    Eppenhof, Koen A. J.
    Lafarge, Maxime W.
    Veta, Mitko
    Pluim, Josien P. W.
    [J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1594 - 1604
  • [10] A Memory-Efficient Hardware Architecture for Deformable Convolutional Networks
    Yu, Yue
    Luo, Jiapeng
    Mao, Wendong
    Wang, Zhongfeng
    [J]. 2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 140 - 145