Deformable Convolutional Networks

被引：3715

作者：

Dai, Jifeng ^{[1
]}

Qi, Haozhi ^{[1
]}

Xiong, Yuwen ^{[1
]}

Li, Yi ^{[1
]}

Zhang, Guodong ^{[1
]}

Hu, Han ^{[1
]}

Wei, Yichen ^{[1
]}

机构：

[1] Microsoft Res Asia, Beijing, Peoples R China

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

关键词：

D O I：

10.1109/ICCV.2017.89

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in their building modules. In this work, we introduce two new modules to enhance the transformation modeling capability of CNNs, namely, deformable convolution and deformable RoI pooling. Both are based on the idea of augmenting the spatial sampling locations in the modules with additional offsets and learning the offsets from the target tasks, without additional supervision. The new modules can readily replace their plain counterparts in existing CNNs and can be easily trained end-to-end by standard back-propagation, giving rise to deformable convolutional networks. Extensive experiments validate the performance of our approach. For the first time, we show that learning dense spatial transformation in deep CNNs is effective for sophisticated vision tasks such as object detection and semantic segmentation. The code is released at https://github.com/msracver/Deformable-ConvNets.

引用

页码：764 / 773

页数：10

共 50 条

[1] Deformable Graph Convolutional Networks
Park, Jinyoung
Yoo, Sungdong
Park, Jihwan
Kim, Hyunwoo J.
[J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 7949 - 7956
[2] Deformable Part Models are Convolutional Neural Networks
Girshick, Ross
Iandola, Forrest
Darrell, Trevor
Malik, Jitendra
[J]. 2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 437 - 446
[3] Learning Conditional Deformable Templates with Convolutional Networks
Dalca, Adrian V.
Rakic, Marianne
Guttag, John
Sabuncu, Mert R.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[4] Deformable Convolutional Neural Networks for Hyperspectral Image Classification
Zhu, Jian
Fang, Leyuan
Ghamisi, Pedram
[J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2018, 15 (08) : 1254 - 1258
[5] Deformable image registration using convolutional neural networks
Eppenhof, Koen A. J.
Lafarge, Maxime W.
Moeskops, Pim
Veta, Mitko
Pluim, Josien P. W.
[J]. MEDICAL IMAGING 2018: IMAGE PROCESSING, 2018, 10574
[6] Monaural Speech Dereverberation Using Deformable Convolutional Networks
Kothapally, Vinay
Hansen, John H. L.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 1712 - 1723
[7] EDVR: Video Restoration with Enhanced Deformable Convolutional Networks
Wang, Xintao
Chan, Kelvin C. K.
Yu, Ke
Dong, Chao
Loy, Chen Change
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2019), 2019, : 1954 - 1963
[8] AN EFFICIENT ACCELERATOR DESIGN METHODOLOGY FOR DEFORMABLE CONVOLUTIONAL NETWORKS
Ahn, Saehyun
Chang, Jung-Woo
Kang, Suk-Ju
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 3075 - 3079
[9] Progressively Trained Convolutional Neural Networks for Deformable Image Registration
Eppenhof, Koen A. J.
Lafarge, Maxime W.
Veta, Mitko
Pluim, Josien P. W.
[J]. IEEE TRANSACTIONS ON MEDICAL IMAGING, 2020, 39 (05) : 1594 - 1604
[10] A Memory-Efficient Hardware Architecture for Deformable Convolutional Networks
Yu, Yue
Luo, Jiapeng
Mao, Wendong
Wang, Zhongfeng
[J]. 2021 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS 2021), 2021, : 140 - 145

← 1 2 3 4 5 →