Cross-Domain Object Detection Algorithm for Complex End-to-End Scene Understanding

被引:0
|
作者
Chen, Aoran [1 ]
Huang, Hai [1 ]
Zhu, Yueyan [1 ]
Xue, Junsheng [1 ]
机构
[1] School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing,100876, China
关键词
Computer vision - Convolutional neural networks - Image reconstruction - Multilayer neural networks - Object detection - Object recognition;
D O I
10.13190/j.jbupt.2023-285
中图分类号
学科分类号
摘要
Conventional deep learning training approaches often assume a similarity between the deployment scenario and the visual domain features present in the training data. However, this assumption might not hold true in complex end-to-end scenarios, making it difficult to meet the demands of intelligent detection services in open environments. In response, an object detection algorithm based on artificial intelligence closed-loop ensemble theory with cross-domain capabilities has been introduced. Within the detection framework, construct a backbone network and bottleneck layer network with multiscale convolutional layers. A visual domain discriminator featuring long-range dependency attention works as a secondary detection head to refine the results. Moreover, a background focusing module, based on spatial reconstruction attention units, is able to enhance learning focused on pseudo-background representations, thereby improving the accuracy of cross-domain object detection. Experimental results show that, compared to two-stage algorithms, the proposed algorithm yields an average precision increase 6.9%, and surpasses single-stage algorithms by 9.0% in complex end-to-end scenarios. © 2024 Beijing University of Posts and Telecommunications. All rights reserved.
引用
收藏
页码:57 / 62
相关论文
共 50 条
  • [41] Deeply Tensor Compressed Transformers for End-to-End Object Detection
    Zhen, Peining
    Gao, Ziyang
    Hou, Tianshu
    Cheng, Yuan
    Chen, Hai-Bao
    [J]. THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4716 - 4724
  • [42] End-to-End Object Detection with Enhanced Positive Sample Filter
    Song, Xiaolin
    Chen, Binghui
    Li, Pengyu
    Wang, Biao
    Zhang, Honggang
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):
  • [43] Dynamic DETR: End-to-End Object Detection with Dynamic Attention
    Dai, Xiyang
    Chen, Yinpeng
    Yang, Jianwei
    Zhang, Pengchuan
    Yuan, Lu
    Zhang, Lei
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2968 - 2977
  • [44] Feature Fusion Pyramid Network for End-to-End Scene Text Detection
    Wu, Yirui
    Zhang, Lilai
    Li, Hao
    Zhang, Yunfei
    Wan, Shaohua
    [J]. ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 23 (11)
  • [45] Harmonious Teacher for Cross-domain Object Detection
    Deng, Jinhong
    Xu, Dongli
    Li, Wen
    Duan, Lixin
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23829 - 23838
  • [46] Cross-Domain Adaptive Teacher for Object Detection
    Li, Yu-Jhe
    Dai, Xiaoliang
    Ma, Chih-Yao
    Liu, Yen-Cheng
    Chen, Kan
    Wu, Bichen
    He, Zijian
    Kitani, Kris
    Vajda, Peter
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 7571 - 7580
  • [47] End-to-end Domain-Adversarial Voice Activity Detection
    Lavechin, Marvin
    Gill, Marie-Philippe
    Bousbib, Ruben
    Bredin, Herve
    Garcia-Perera, Leibny Paola
    [J]. INTERSPEECH 2020, 2020, : 3685 - 3689
  • [48] End-to-End Object Detection by Sparse R-CNN With Hybrid Matching in Complex Traffic Scenes
    Han, Xue-juan
    Qu, Zhong
    Wang, Shi-Yan
    Xia, Shu-Fang
    Wang, Sheng-Ye
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 512 - 525
  • [49] AN END-TO-END SCALABLE OBJECT DETECTION NETWORK FOR REMOTE SENSING IMAGES
    Duan, Yani
    Teng, Zhu
    Zhang, Baopeng
    Fan, Jianping
    [J]. IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 960 - 963
  • [50] An End-to-End Cascaded Image Deraining and Object Detection Neural Network
    Wang, Kaige
    Wang, Tianming
    Qu, Jianchuang
    Jiang, Huatao
    Li, Qing
    Chang, Lin
    [J]. IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) : 9541 - 9548