InternDiffuseDet: Object Detection Method Combining Deformable Convolution and Diffusion Model

被引:0
|
作者
Yuan, Zhixiang [1 ]
Gao, Yongqi [1 ]
机构
[1] School of Computer Science and Technology, Anhui University of Technology, Anhui, Ma’anshan,243032, China
关键词
Convolution;
D O I
10.3778/j.issn.1002-8331.2309-0272
中图分类号
学科分类号
摘要
The paper focuses on the topic of object detection and aims to address issues such as missed detections, limited feature extraction capability, and low detection accuracy in complex scenes. Building upon DiffusionDet, a modified approach is proposed that combines deformable convolutions and diffusion models for object detection. The core idea is to increase the quantity and quality of feature maps before entering the detection head. This is achieved by introducing InternImage and DCNv3 deformable convolution operators into the backbone network, enhancing the receptive field and non-linear modeling capability of the model. An improved feature pyramid network (CS-FPN) based on selective weighting is proposed to enhance the intermediate FPN feature pyramids. Channel and spatial separations are achieved using depth-wise separable convolutions, with the traditional upsampling operation being replaced by the CARAFE operator to improve resolution and semantic information transfer. Following that, the SGE attention mechanism is employed to reassemble the feature maps, ensuring the preservation of hierarchical information during diffusion. Prior to entering the detection head, the DDIM diffusion operation is performed to obtain feature maps at different time steps, thereby augmenting the quantity of detection feature maps. Finally, the EIOU algorithm is introduced in target box matching and loss functions to handle position deviations and scale differences between target boxes. Experimental results on the COCO dataset and road detection dataset demonstrate that the improved model is 3.8 and 3.6 percentage points higher than the original model, respectively, in the same experimental settings. These results indicate the potential of the proposed method to enhance the accuracy and robustness of object detection, providing new insights and approaches for addressing object detection challenges in real-world scenarios. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:203 / 215
相关论文
共 50 条
  • [1] A Novel Object Detection Method for Solid Waste Incorporating a Weighted Deformable Convolution
    Xu, Xiong
    Cheng, Tao
    Zhao, Beibei
    Wang, Chao
    Tong, Xiaohua
    Feng, Yongjiu
    Xie, Huan
    Jin, Yanmin
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2023, 89 (11): : 679 - 689
  • [2] APNet: Accurate Positioning Deformable Convolution for UAV Image Object Detection
    Zhang, Peiran
    Zhang, Guoxin
    Yang, Kuihe
    IEEE LATIN AMERICA TRANSACTIONS, 2024, 22 (04) : 304 - 311
  • [3] OBJECT DETECTION IN VHR IMAGE USING TRANSFER LEARNING WITH DEFORMABLE CONVOLUTION
    Cao, Zeyu
    Li, Xiaorun
    Zhao, Liaoying
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 326 - 329
  • [4] DA-FPN: Deformable Convolution and Feature Alignment for Object Detection
    Fu, Xiang
    Yuan, Zemin
    Yu, Tingjian
    Ge, Yun
    ELECTRONICS, 2023, 12 (06)
  • [5] Object Detection Based on Deformable Part Model
    Wei Lei
    Xu Zhiyong
    8TH INTERNATIONAL SYMPOSIUM ON ADVANCED OPTICAL MANUFACTURING AND TESTING TECHNOLOGY: OPTICAL TEST, MEASUREMENT TECHNOLOGY, AND EQUIPMENT, 2016, 9684
  • [6] The Fastest Deformable Part Model for Object Detection
    Yan, Junjie
    Lei, Zhen
    Wen, Longyin
    Li, Stan Z.
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2497 - 2504
  • [7] Object detection with location-aware deformable convolution and backward attention filtering
    Zhang, Chen
    Kim, Joohee
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9444 - 9453
  • [8] PATTERN ANALYSIS OF DEFORMABLE CONVOLUTION ON RETINANET WITH SEMANTIC FILTER MECHANISM FOR OBJECT DETECTION
    Zhu, Shengyu
    Zhang, Junping
    Guo, Qingle
    Zhong, Chongxiao
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 3071 - 3074
  • [9] Research on FDC YOLO v8 Underwater Biological Object Detection Method Improved by Deformable Convolution
    Yuan, Hongchun
    Li, Chunqiao
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (11): : 140 - 146
  • [10] Improved Deformable Convolution Method for Aircraft Object Detection in Flight Based on Feature Separation in Remote Sensing Images
    Yu, Lijian
    Zhi, Xiyang
    Hu, Jianming
    Zhang, Shuqing
    Niu, Ruize
    Zhang, Wei
    Jiang, Shikai
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8313 - 8323