Feature refinement with multi-level context for object detection

被引:4
|
作者
Ma, Yingdong [1 ]
Wang, Yanan [1 ]
机构
[1] Inner Mongolia Univ, Coll Comp Sci, 235 West Daxue Rd, Hohhot, Peoples R China
关键词
Feature refinement; Cross-pooling; Global context; Object detection;
D O I
10.1007/s00138-023-01402-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robust multi-scale object detection is challenging as it requires both spatial details and semantic knowledge to deal with problems including high scale variation and cluttered background. Appropriate fusion of high-resolution features with deep semantic features is the key issue to achieve better performance. Different approaches have been developed to extract and combine deep features with shallow layer spatial features, such as feature pyramid network. However, high-resolution feature maps contain noisy and distractive features. Directly combines shallow features with semantic features might degrade detection accuracy. Besides, contextual information is also important for multi-scale object detection. In this work, we present a feature refinement scheme to tackle the feature fusion problem. The proposed feature refinement module increases feature resolution and refine feature maps progressively with the guidance from deep features. Meanwhile, we propose a context extraction method to capture global and local contextual information. The method utilizes a multi-level cross-pooling unit to extract global context and a cascaded context module to extract local context. The proposed object detection framework has been evaluated on PASCAL VOC and MS COCO datasets. Experimental results demonstrate that the proposed method performs favorably against state-of-the-art approaches.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Symbolic multi-level verification of refinement
    Hendricx, S
    Claesen, L
    [J]. NINTH GREAT LAKES SYMPOSIUM ON VLSI, PROCEEDINGS, 1999, : 288 - 291
  • [42] Multi-level consistency regularization for domain adaptive object detection
    Tian, Kun
    Zhang, Chenghao
    Wang, Ying
    Xiang, Shiming
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24): : 18003 - 18018
  • [43] Unsupervised Salient Object Detection by Aggregating Multi-Level Cues
    Xia, Chenxing
    Zhang, Hanling
    [J]. IEEE PHOTONICS JOURNAL, 2018, 10 (06):
  • [44] Multi-level Proposal Relations Aggregation for Video Object Detection
    Yu, Chongkai
    Chen, Wenjie
    Wu, Bing
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 734 - 745
  • [45] Deep Salient Object Detection by Integrating Multi-level Cues
    Zhang, Jing
    Dai, Yuchao
    Porikli, Fatih
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 1 - 10
  • [46] Multi-level consistency regularization for domain adaptive object detection
    Kun Tian
    Chenghao Zhang
    Ying Wang
    Shiming Xiang
    [J]. Neural Computing and Applications, 2023, 35 : 18003 - 18018
  • [47] Multi-level feature representations for video semantic concept detection
    Li, Haojie
    Liu, Lijuan
    Sun, Fuming
    Bao, Yu
    Liu, Chenxin
    [J]. NEUROCOMPUTING, 2016, 172 : 64 - 70
  • [48] Multi-branch feature fusion and refinement network for salient object detection
    Yang, Jinyu
    Shi, Yanjiao
    Zhang, Jin
    Guo, Qianqian
    Zhang, Qing
    Cui, Liu
    [J]. MULTIMEDIA SYSTEMS, 2024, 30 (04)
  • [49] Multi-stream feature refinement network for human object interaction detection
    Shao, Zhanpeng
    Hu, Zhongyan
    Yang, Jianyu
    Li, Youfu
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 86
  • [50] MFFNet: Single facial depth map refinement using multi-level feature fusion
    Zhang, Fan
    Liu, Na
    Hu, Yongli
    Duan, Fuqing
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2022, 103