Salient Object Detection via Dynamic Scale Routing

被引:21
|
作者
Wu, Zhenyu [1 ]
Li, Shuai [1 ,2 ]
Chen, Chenglizhao [1 ]
Qin, Hong [3 ]
Hao, Aimin [1 ,2 ,4 ]
机构
[1] Beihang Univ, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China
[3] SUNY Stony Brook, Dept Comp Sci, Stony Brook, NY 11794 USA
[4] Chinese Acad Med Sci, Res Unit Virtual Human & Virtual Surg 2019RU004, Beijing 100006, Peoples R China
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Kernel; Decoding; Feature extraction; Routing; Deep learning; Object detection; Technological innovation; Dynamic scale routing; scale-aware feature aggregation; salient object detection; NETWORK; FUSION;
D O I
10.1109/TIP.2022.3214332
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent research advances in salient object detection (SOD) could largely be attributed to ever-stronger multi-scale feature representation empowered by the deep learning technologies. The existing SOD deep models extract multi-scale features via the off-the-shelf encoders and combine them smartly via various delicate decoders. However, the kernel sizes in this commonly-used thread are usually "fixed". In our new experiments, we have observed that kernels of small size are preferable in scenarios containing tiny salient objects. In contrast, large kernel sizes could perform better for images with large salient objects. Inspired by this observation, we advocate the "dynamic" scale routing (as a brand-new idea) in this paper. It will result in a generic plug-in that could directly fit the existing feature backbone. This paper's key technical innovations are two-fold. First, instead of using the vanilla convolution with fixed kernel sizes for the encoder design, we propose the dynamic pyramid convolution (DPConv), which dynamically selects the best-suited kernel sizes w.r.t. the given input. Second, we provide a self-adaptive bidirectional decoder design to accommodate the DPConv-based encoder best. The most significant highlight is its capability of routing between feature scales and their dynamic collection, making the inference process scale-aware. As a result, this paper continues to enhance the current SOTA performance. Both the code and dataset are publicly available at https://github.com/wuzhenyubuaa/DPNet.
引用
收藏
页码:6649 / 6663
页数:15
相关论文
共 50 条
  • [41] Multi-scale Interactive Network for Salient Object Detection
    Pang, Youwei
    Zhao, Xiaoqi
    Zhang, Lihe
    Lu, Huchuan
    [J]. arXiv, 2020,
  • [42] Salient Object Detection with CNNs and Multi-scale CRFs
    Xu, Yingyue
    Hong, Xiaopeng
    Zhao, Guoying
    [J]. IMAGE ANALYSIS, 2019, 11482 : 233 - 245
  • [43] Salient object detection based on multi-scale contrast
    Wang, Hai
    Dai, Lei
    Cai, Yingfeng
    Sun, Xiaoqiang
    Chen, Long
    [J]. NEURAL NETWORKS, 2018, 101 : 47 - 56
  • [44] Salient object detection on large-scale video data
    Zhang, Shile
    Fan, Jianping
    Lu, Hong
    Xue, Xiangyang
    [J]. 2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 3704 - +
  • [45] SBN: Scale Balance Network for Accurate Salient Object Detection
    Tan, Zhenshan
    Gu, Xiaodong
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [46] Multi-Scale Cascade Network for Salient Object Detection
    Li, Xin
    Yang, Fan
    Cheng, Hong
    Chen, Junyu
    Guo, Yuxiao
    Chen, Leiting
    [J]. PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 439 - 447
  • [47] Weakly supervised salient object detection via double object proposals guidance
    Zhou, Zhiheng
    Guo, Yongfan
    Dai, Ming
    Huang, Junchu
    Li, Xiangwei
    [J]. IET IMAGE PROCESSING, 2021, 15 (09) : 1957 - 1970
  • [48] Salient object detection via contrast information and object vision organization cues
    Qi, Shengxiang
    Yu, Jin-Gang
    Ma, Jie
    Li, Yansheng
    Tian, Jinwen
    [J]. NEUROCOMPUTING, 2015, 167 : 390 - 405
  • [49] DHNet: Salient Object Detection With Dynamic Scale-Aware Learning and Hard-Sample Refinement
    Zhang, Chenhao
    Gao, Shanshan
    Mao, Deqian
    Zhou, Yuanfeng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) : 7772 - 7782
  • [50] A novel dynamic graph evolution network for salient object detection
    Mingzhu Xu
    Ping Fu
    Bing Liu
    Hongtao Yin
    Junbao Li
    [J]. Applied Intelligence, 2022, 52 : 2854 - 2871