A Vision Enhancement and Feature Fusion Multiscale Detection Network

被引:0
|
作者
Chengwu Qian
Jiangbo Qian
Chong Wang
Xulun Ye
Caiming Zhong
机构
[1] Ningbo University,
来源
关键词
Object detection; Object occlusion; Swin transformer; Vision enhancement;
D O I
暂无
中图分类号
学科分类号
摘要
In the field of object detection, there is often a high level of occlusion in real scenes, which can very easily interfere with the accuracy of the detector. Currently, most detectors use a convolutional neural network (CNN) as a backbone network, but the robustness of CNNs for detection under cover is poor, and the absence of object pixels makes conventional convolution ineffective in extracting features, leading to a decrease in detection accuracy. To address these two problems, we propose VFN (A Vision Enhancement and Feature Fusion Multiscale Detection Network), which first builds a multiscale backbone network using different stages of the Swin Transformer, and then utilizes a vision enhancement module using dilated convolution to enhance the vision of feature points at different scales and address the problem of missing pixels. Finally, the feature guidance module enables features at each scale to be enhanced by fusing with each other. The total accuracy demonstrated by VFN on both the PASCAL VOC dataset and the CrowdHuman dataset is better than that of other methods, and its ability to find occluded objects is also better, demonstrating the effectiveness of our method.The code is available at https://github.com/qcw666/vfn.
引用
收藏
相关论文
共 50 条
  • [21] MFFSODNet: Multiscale Feature Fusion Small Object Detection Network for UAV Aerial Images
    Jiang, Lingjie
    Yuan, Baoxi
    Du, Jiawei
    Chen, Boyu
    Xie, Hanfei
    Tian, Juan
    Yuan, Ziqi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 14
  • [22] Multiscale Feature Enhancement Network for Salient Object Detection in Optical Remote Sensing Images
    Wang, Zhen
    Guo, Jianxin
    Zhang, Chuanlei
    Wang, Buhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [23] Synthetic Aperture Radar Ship Detection Based on Efficient Multiscale Feature Enhancement Network
    Shan H.
    Liu W.
    Wang X.
    Hu Y.
    Duan X.
    Zhang Y.
    IEEE Transactions on Aerospace and Electronic Systems, 2024, 60 (06) : 1 - 16
  • [24] Multiscale Feature Interactive Network for Multifocus Image Fusion
    Liu, Yu
    Wang, Lei
    Cheng, Juan
    Chen, Xun
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
  • [25] Enhanced Multiscale Feature Fusion Network for HSI Classification
    Yang, Jiaqi
    Wu, Chen
    Du, Bo
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12): : 10328 - 10347
  • [26] Multiscale Feature Interactive Network for Multifocus Image Fusion
    Liu, Yu
    Wang, Lei
    Cheng, Juan
    Chen, Xun
    IEEE Transactions on Instrumentation and Measurement, 2021, 70
  • [27] Receptive field enhancement and attention feature fusion network for underwater object detection
    Xu, Huipu
    He, Zegang
    Chen, Shuo
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33007
  • [28] Pulmonary Nodule Detection Based on Multiscale Feature Fusion
    Zhao, Yue
    Wang, Zhongyang
    Liu, Xinyao
    Chen, Qi
    Li, Chuangang
    Zhao, Hongshuo
    Wang, Zhiqiong
    Computational and Mathematical Methods in Medicine, 2022, 2022
  • [29] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2526 - 2538
  • [30] MFFENet: Multiscale Feature Fusion and Enhancement Network For RGB-Thermal Urban Road Scene Parsing
    Zhou, Wujie
    Lin, Xinyang
    Lei, Jingsheng
    Yu, Lu
    Hwang, Jenq-Neng
    IEEE Transactions on Multimedia, 2022, 24 : 2526 - 2538