ORSI Salient Object Detection via Cross-Scale Interaction and Enlarged Receptive Field

被引:8
|
作者
Zheng, Jianwei [1 ]
Quan, Yueqian [1 ]
Zheng, Hang [1 ]
Wang, Yibin [1 ]
Pan, Xiang [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
中国国家自然科学基金;
关键词
Power capacitors; Feature extraction; Semantics; Optical sensors; Shape; Optical imaging; Computer architecture; Cross-scale interaction; receptive field; remote sensing image (RSI); salient object detection (SOD); NETWORK;
D O I
10.1109/LGRS.2023.3249764
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Due to the diversity of scales and shapes, the uncertainty of object position, and the complexity of edge details, the recent merging problem of salient object detection in optical remote sensing image (RSI-SOD) is a considerably challenging topic. To cope with the challenges, we propose a new cross-scale interaction network (CIFNet) equipped with the enlarged receptive field, which mainly contains three modules in an encoder-decoder architecture, including a furcate skip connection module (FSCM), a global leading attention module, and an expansion-integration module (EIM). First, the FSCM uses dilated convolutions to enlarge the receptive field and furcate skip connections to capture more multiscale contextual information, both of which facilitate the adaptability of the model to different sizes, shapes, and quantities of the target objects. Second, on the low-resolution branch, a global leading attention module (GLM) locates the potentially significant object positions in the feature map from a global semantic perspective. Finally, through an attention-guided cascade structure, the EIM seeks more delicate characteristics by refining the features in a coarse-to-fine fashion. Extensive experiments are conducted on two RSI-SOD datasets, from which superior results can be achieved by our CIFNet, outperforming the other state-of-the-art methods. Compared with the second-best method, the performance gain of our method reaches 3.45% on mean absolute error (MAE) and 1.38% on F-beta(adp). Notably, the proposed CIF-Net runs with 40.40-M parameters, 14.8-GFLOPs computational complexity, and 58-frames/s inference speed, which guarantees high efficiency.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Cross-scale resolution consistent network for salient object detection
    Huang, Xiaoyu
    Liu, Wei
    Li, Minghui
    Nie, Hangyu
    IET IMAGE PROCESSING, 2024, 18 (10) : 2788 - 2799
  • [2] Cross-Scale Edge Purification Network for salient object detection of steel defect images
    Ding, Tuo
    Li, Gongyang
    Liu, Zhi
    MEASUREMENT, 2022, 199
  • [3] ORSI Salient Object Detection via Multiscale Joint Region and Boundary Model
    Tu, Zhengzheng
    Wang, Chao
    Li, Chenglong
    Fan, Minghao
    Zhao, Haifeng
    Luo, Bin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [4] Cross-scale information enhancement for object detection
    Li, Tie-jun
    Zhao, Hui-feng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (33) : 79193 - 79206
  • [5] Global Guided Cross-Modal Cross-Scale Network for RGB-D Salient Object Detection
    Wang, Shuaihui
    Jiang, Fengyi
    Xu, Boqian
    SENSORS, 2023, 23 (16)
  • [6] DCIFPN: Deformable cross-scale interaction feature pyramid network for object detection
    Xiao, Junrui
    Jiang, He
    Li, Zhikai
    Gu, Qingyi
    IET IMAGE PROCESSING, 2023, 17 (09) : 2596 - 2610
  • [7] CSFFNet: Lightweight cross-scale feature fusion network for salient object detection in remote sensing images
    Wang, Longbao
    Long, Chong
    Li, Xin
    Tang, Xiaodan
    Bai, Zhipeng
    Gao, Hongmin
    IET IMAGE PROCESSING, 2024, 18 (03) : 602 - 614
  • [8] Weakly Supervised RGBT salient object detection via SAM-Guided Label Optimization and Progressive Cross-modal Cross-scale Fusion
    Zhai, Sulan
    Liu, Chengzhuang
    Tu, Zhengzheng
    Li, Chenglong
    Gao, Liuxuanqi
    INFORMATION FUSION, 2025, 120
  • [9] Cross-scale Dynamic Relation Network for Object Detection
    Zhong, Xinfang
    Li, Zhixin
    PRICAI 2023: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2024, 14325 : 327 - 339
  • [10] RFCS-YOLO: Target Detection Algorithm in Adverse Weather Conditions via Receptive Field Enhancement and Cross-Scale Fusion
    Liu, Gang
    Huang, Yingzheng
    Yan, Shuguang
    Hou, Enxiang
    SENSORS, 2025, 25 (03)