InternDiffuseDet: Object Detection Method Combining Deformable Convolution and Diffusion Model

被引:0
|
作者
Yuan, Zhixiang [1 ]
Gao, Yongqi [1 ]
机构
[1] School of Computer Science and Technology, Anhui University of Technology, Anhui, Ma’anshan,243032, China
关键词
Convolution;
D O I
10.3778/j.issn.1002-8331.2309-0272
中图分类号
学科分类号
摘要
The paper focuses on the topic of object detection and aims to address issues such as missed detections, limited feature extraction capability, and low detection accuracy in complex scenes. Building upon DiffusionDet, a modified approach is proposed that combines deformable convolutions and diffusion models for object detection. The core idea is to increase the quantity and quality of feature maps before entering the detection head. This is achieved by introducing InternImage and DCNv3 deformable convolution operators into the backbone network, enhancing the receptive field and non-linear modeling capability of the model. An improved feature pyramid network (CS-FPN) based on selective weighting is proposed to enhance the intermediate FPN feature pyramids. Channel and spatial separations are achieved using depth-wise separable convolutions, with the traditional upsampling operation being replaced by the CARAFE operator to improve resolution and semantic information transfer. Following that, the SGE attention mechanism is employed to reassemble the feature maps, ensuring the preservation of hierarchical information during diffusion. Prior to entering the detection head, the DDIM diffusion operation is performed to obtain feature maps at different time steps, thereby augmenting the quantity of detection feature maps. Finally, the EIOU algorithm is introduced in target box matching and loss functions to handle position deviations and scale differences between target boxes. Experimental results on the COCO dataset and road detection dataset demonstrate that the improved model is 3.8 and 3.6 percentage points higher than the original model, respectively, in the same experimental settings. These results indicate the potential of the proposed method to enhance the accuracy and robustness of object detection, providing new insights and approaches for addressing object detection challenges in real-world scenarios. © 2024 Journal of Computer Engineering and Applications Beijing Co., Ltd.; Science Press. All rights reserved.
引用
收藏
页码:203 / 215
相关论文
共 50 条
  • [31] A Detection Method for Pavement Cracks Combining Object Detection and Attention Mechanism
    Yao, Hui
    Liu, Yanhao
    Li, Xin
    You, Zhanping
    Feng, Yu
    Lu, Weiwei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 22179 - 22189
  • [32] An Adaptive Spherical Collision Detection and Resolution Method for Deformable Object Simulation
    Qian, Kun
    Yang, Xiaosong
    Zhang, Jianjun
    Wang, Meili
    2015 14TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2015, : 8 - 17
  • [33] Efficient Metal Corrosion Area Detection Model Combining Convolution and Transformer
    Guo, Jiurong
    Wang, Li
    Hua, Liang
    Applied Sciences (Switzerland), 2024, 14 (21):
  • [34] A lightweight dead fish detection method based on deformable convolution and YOLOV4
    Zhao, Shili
    Zhang, Song
    Lu, Jiamin
    Wang, He
    Feng, Yu
    Shi, Chen
    Li, Daoliang
    Zhao, Ran
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 198
  • [35] Fast object detection with deformable part models based on hierarchical model
    School of Automation Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
    J. Theor. Appl. Inf. Technol., 2012, 1 (142-149):
  • [36] AN IMPROVED OBJECT DETECTION METHOD BASED ON DEEP CONVOLUTION NEURAL NETWORK FOR SMOKE DETECTION
    Zeng, Junying
    Lin, Zuoyong
    Qi, Chuanbo
    Zhao, Xiaoxiao
    Wang, Fan
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 184 - 189
  • [37] GJK for deformable object collision detection
    Hatab, Maher
    Kheddar, Abderrahmane
    2006 IEEE INTERNATIONAL WORKSHOP ON HAPTIC AUDIO VISUAL ENVIRONMENTS AND THEIR APPLICATIONS, 2006, : 147 - +
  • [38] Object detection and recognition via deformable illumination and deformable shape
    Zhou, Qiang
    Ma, Limin
    Celenk, Mehmet
    Chelberg, David
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2737 - +
  • [39] A High-Precision Human Fall Detection Model Based on FasterNet and Deformable Convolution
    Zheng, Xiuxiu
    Cao, Jianzhao
    Wang, Changtao
    Ma, Pengyuan
    ELECTRONICS, 2024, 13 (14)
  • [40] A Moving Object Detection Method Combining Color and Depth data
    Hu T.
    Zhu X.
    Guo W.
    Zhang F.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (02): : 276 - 282