Salient Object Detection via Dynamic Scale Routing

被引：21

作者：

Wu, Zhenyu ^{[1
]}

Li, Shuai ^{[1
,2
]}

Chen, Chenglizhao ^{[1
]}

Qin, Hong ^{[3
]}

Hao, Aimin ^{[1
,2
,4
]}

机构：

[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

[3] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA

[4] Chinese Acad Med Sci, Res Unit Virtual Human & Virtual Surg 2019RU004, Beijing 100006, Peoples R China

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2022年 / 31卷

基金：

中国国家自然科学基金; 美国国家科学基金会;

关键词：

Kernel; Decoding; Feature extraction; Routing; Deep learning; Object detection; Technological innovation; Dynamic scale routing; scale-aware feature aggregation; salient object detection; NETWORK; FUSION;

D O I：

10.1109/TIP.2022.3214332

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent research advances in salient object detection (SOD) could largely be attributed to ever-stronger multi-scale feature representation empowered by the deep learning technologies. The existing SOD deep models extract multi-scale features via the off-the-shelf encoders and combine them smartly via various delicate decoders. However, the kernel sizes in this commonly-used thread are usually "fixed". In our new experiments, we have observed that kernels of small size are preferable in scenarios containing tiny salient objects. In contrast, large kernel sizes could perform better for images with large salient objects. Inspired by this observation, we advocate the "dynamic" scale routing (as a brand-new idea) in this paper. It will result in a generic plug-in that could directly fit the existing feature backbone. This paper's key technical innovations are two-fold. First, instead of using the vanilla convolution with fixed kernel sizes for the encoder design, we propose the dynamic pyramid convolution (DPConv), which dynamically selects the best-suited kernel sizes w.r.t. the given input. Second, we provide a self-adaptive bidirectional decoder design to accommodate the DPConv-based encoder best. The most significant highlight is its capability of routing between feature scales and their dynamic collection, making the inference process scale-aware. As a result, this paper continues to enhance the current SOTA performance. Both the code and dataset are publicly available at https://github.com/wuzhenyubuaa/DPNet.

引用

页码：6649 / 6663

页数：15

共 50 条

[41] Multi-scale Interactive Network for Salient Object Detection
Pang, Youwei
Zhao, Xiaoqi
Zhang, Lihe
Lu, Huchuan
[J]. arXiv, 2020,
[42] Salient Object Detection with CNNs and Multi-scale CRFs
Xu, Yingyue
Hong, Xiaopeng
Zhao, Guoying
[J]. IMAGE ANALYSIS, 2019, 11482 : 233 - 245
[43] Salient object detection based on multi-scale contrast
Wang, Hai
Dai, Lei
Cai, Yingfeng
Sun, Xiaoqiang
Chen, Long
[J]. NEURAL NETWORKS, 2018, 101 : 47 - 56
[44] Salient object detection on large-scale video data
Zhang, Shile
Fan, Jianping
Lu, Hong
Xue, Xiangyang
[J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3704 - +
[45] SBN: Scale Balance Network for Accurate Salient Object Detection
Tan, Zhenshan
Gu, Xiaodong
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[46] Multi-Scale Cascade Network for Salient Object Detection
Li, Xin
Yang, Fan
Cheng, Hong
Chen, Junyu
Guo, Yuxiao
Chen, Leiting
[J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 439 - 447
[47] Weakly supervised salient object detection via double object proposals guidance
Zhou, Zhiheng
Guo, Yongfan
Dai, Ming
Huang, Junchu
Li, Xiangwei
[J]. IET IMAGE PROCESSING, 2021, 15 (09) : 1957 - 1970
[48] Salient object detection via contrast information and object vision organization cues
Qi, Shengxiang
Yu, Jin-Gang
Ma, Jie
Li, Yansheng
Tian, Jinwen
[J]. NEUROCOMPUTING, 2015, 167 : 390 - 405
[49] DHNet: Salient Object Detection With Dynamic Scale-Aware Learning and Hard-Sample Refinement
Zhang, Chenhao
Gao, Shanshan
Mao, Deqian
Zhou, Yuanfeng
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7772 - 7782
[50] A novel dynamic graph evolution network for salient object detection
Mingzhu Xu
Ping Fu
Bing Liu
Hongtao Yin
Junbao Li
[J]. Applied Intelligence, 2022, 52 : 2854 - 2871

← 1 2 3 4 5 →