ORSI Salient Object Detection via Cross-Scale Interaction and Enlarged Receptive Field

被引:8
|
作者
Zheng, Jianwei [1 ]
Quan, Yueqian [1 ]
Zheng, Hang [1 ]
Wang, Yibin [1 ]
Pan, Xiang [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Power capacitors; Feature extraction; Semantics; Optical sensors; Shape; Optical imaging; Computer architecture; Cross-scale interaction; receptive field; remote sensing image (RSI); salient object detection (SOD); NETWORK;
D O I
10.1109/LGRS.2023.3249764
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the diversity of scales and shapes, the uncertainty of object position, and the complexity of edge details, the recent merging problem of salient object detection in optical remote sensing image (RSI-SOD) is a considerably challenging topic. To cope with the challenges, we propose a new cross-scale interaction network (CIFNet) equipped with the enlarged receptive field, which mainly contains three modules in an encoder-decoder architecture, including a furcate skip connection module (FSCM), a global leading attention module, and an expansion-integration module (EIM). First, the FSCM uses dilated convolutions to enlarge the receptive field and furcate skip connections to capture more multiscale contextual information, both of which facilitate the adaptability of the model to different sizes, shapes, and quantities of the target objects. Second, on the low-resolution branch, a global leading attention module (GLM) locates the potentially significant object positions in the feature map from a global semantic perspective. Finally, through an attention-guided cascade structure, the EIM seeks more delicate characteristics by refining the features in a coarse-to-fine fashion. Extensive experiments are conducted on two RSI-SOD datasets, from which superior results can be achieved by our CIFNet, outperforming the other state-of-the-art methods. Compared with the second-best method, the performance gain of our method reaches 3.45% on mean absolute error (MAE) and 1.38% on F-beta(adp). Notably, the proposed CIF-Net runs with 40.40-M parameters, 14.8-GFLOPs computational complexity, and 58-frames/s inference speed, which guarantees high efficiency.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] CSSDet: small object detection via cross-scale feature enhancement on drone-view images
    Cheng, Gui
    Ding, Qing
    Cai, Bowen
    Dang, Chaoya
    Wang, Yu
    Zuo, Xiaolong
    Shao, Zhenfeng
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [22] Salient object detection via multi-scale attention CNN
    Ji, Yuzhu
    Zhang, Haijun
    Wu, Q. M. Jonathan
    NEUROCOMPUTING, 2018, 322 : 130 - 140
  • [23] CSEF-Net: Cross-Scale SAR Ship Detection Network Based on Efficient Receptive Field and Enhanced Hierarchical Fusion
    Zhang, Handan
    Wu, Yiquan
    REMOTE SENSING, 2024, 16 (04)
  • [24] Cross-Scale Feature Fusion for Object Detection in Optical Remote Sensing Images
    Cheng, Gong
    Si, Yongjie
    Hong, Hailong
    Yao, Xiwen
    Guo, Lei
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (03) : 431 - 435
  • [25] Dense Receptive Field for Object Detection
    Yao, Yongqiang
    Dong, Yuan
    Huang, Zesang
    Bai, Hongliang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 1815 - 1820
  • [26] A YOLOX Object Detection Algorithm Based on Bidirectional Cross-scale Path Aggregation
    Liu, Qunpo
    Zhang, Jingwen
    Zhao, Yi
    Bu, Xuhui
    Hanajima, Naohiko
    NEURAL PROCESSING LETTERS, 2024, 56 (01)
  • [27] Learning Discriminative Representations From Cross-Scale Features for Camouflaged Object Detection
    Wang, Yongchao
    Bi, Xiuli
    Liu, Bo
    Wei, Yang
    Li, Weisheng
    Xiao, Bin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (12) : 12756 - 12769
  • [28] A YOLOX Object Detection Algorithm Based on Bidirectional Cross-scale Path Aggregation
    Qunpo Liu
    Jingwen Zhang
    Yi Zhao
    Xuhui Bu
    Naohiko Hanajima
    Neural Processing Letters, 56
  • [29] RCNet: Reverse Feature Pyramid and Cross-scale Shift Network for Object Detection
    Zong, Zhuofan
    Cao, Qianggang
    Leng, Biao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 5637 - 5645
  • [30] Cross-Stage Multi-Scale Interaction Network for RGB-D Salient Object Detection
    Yi, Kang
    Zhu, Jinchao
    Guo, Fu
    Xu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2402 - 2406