Depth-Guided Progressive Network for Object Detection

被引:2
|
作者
Ma, Jia-Wei [1 ,2 ]
Liang, Min [2 ]
Chen, Song-Lu [1 ,2 ]
Chen, Feng [3 ]
Tian, Shu [2 ]
Qin, Jingyan [4 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, USTB EEasyTech Joint Lab Artificial Intelligence, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
[3] EEasy Technol Co Ltd, Zhuhai 519000, Peoples R China
[4] Univ Sci & Technol Beijing, Dept Ind Design, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Detectors; Interference; Signal to noise ratio; Semantics; Location awareness; multi-scale object; depth-guided; progressive sampling;
D O I
10.1109/TITS.2022.3156365
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multi-scale object detection in natural scenes is still challenging. To enhance the multi-scale perception capability, some algorithms combine the lower-level and higher-level information via multi-scale feature fusion strategies. However, the inherent spatial properties among instances and relations between foreground and background are ignored. In addition, the human-defined ``center-based'' regression quality evaluation strategy, predicting a high-to-low score based on a linear relationship with the distance to the center of ground-truth box, is not robust to scale-variant objects. In this work, we propose a Depth-Guided Progressive Network (DGPNet) for multi-scale object detection. Specifically, besides the prediction of classification and localization, the depth is estimated and used to guide the image features in a weighted manner to obtain a better spatial representation. Therefore, depth estimation and 2D object detection are simultaneously learned via a unified network, where the depth features are merged as auxiliary information into the detection branch to enhance the discrimination among multi-scale objects. Moreover, to overcome the difficulty of empirically fitting the localization quality function, high-quality predicted boxes on scale-variant objects are more adaptively obtained by an IoU-aware progressive sampling strategy. We divide the sampling process into two stages, i.e., ``statistical-aware'' and ``IoU-aware''. The former selects thresholds for positive samples based on statistical characteristics of multi-scale instances, and the latter further selects high-quality samples by IoU on the basis of the former. Therefore, the final ranking scores better reflect the quality of localization. Experiments verify that our method outperforms state-of-the-art methods on the KINS and Cityscapes dataset.
引用
收藏
页码:19523 / 19533
页数:11
相关论文
共 50 条
  • [21] DXYW: a depth-guided multi-channel edge detection model
    Chuan Lin
    Qu Wang
    Shujuan Wan
    Signal, Image and Video Processing, 2023, 17 : 481 - 489
  • [22] Depth-Guided Dense Dynamic Filtering Network for Bokeh Effect Rendering
    Purohit, Kuldeep
    Suin, Maitreya
    Kandula, Praveen
    Ambasamudram, Rajagopalan
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3417 - 3426
  • [23] Depth-Guided Dehazing Network for Long-Range Aerial Scenes
    Wang, Yihu
    Zhao, Jilin
    Yao, Liangliang
    Fu, Changhong
    REMOTE SENSING, 2024, 16 (12)
  • [24] DGT: Depth-guided RGB-D occluded target detection with transformers
    Kelei Xu
    Chunyan Wang
    Wanzhong Zhao
    Jinqiang Liu
    Applied Intelligence, 2025, 55 (5)
  • [25] Depth-guided deep filtering network for efficient single image bokeh rendering
    Quan Chen
    Bolun Zheng
    Xiaofei Zhou
    Aiai Huang
    Yaoqi Sun
    Chuqiao Chen
    Chenggang Yan
    Shanxin Yuan
    Neural Computing and Applications, 2023, 35 : 20869 - 20887
  • [26] Semantic attention and relative scene depth-guided network for underwater image enhancement
    Chen, Tingkai
    Wang, Ning
    Chen, Yanzheng
    Kong, Xiangjun
    Lin, Yejin
    Zhao, Hong
    Karimi, Hamid Reza
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
  • [27] A Depth-Guided Attention Strategy for Crowd Counting
    Chen, Hao
    Li, Zhan
    Bhanu, Bir
    Lu, Dongping
    Han, Xuming
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 25 - 37
  • [28] Depth-guided deep filtering network for efficient single image bokeh rendering
    Chen, Quan
    Zheng, Bolun
    Zhou, Xiaofei
    Huang, Aiai
    Sun, Yaoqi
    Chen, Chuqiao
    Yan, Chenggang
    Yuan, Shanxin
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20869 - 20887
  • [29] Tracking Based Depth-guided Video Inpainting
    Hatheele, Saroj
    Zaveri, Mukesh A.
    2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
  • [30] Unpaired Image Dehazing With Physical-Guided Restoration and Depth-Guided Refinement
    Chen, Xiang
    Li, Yufeng
    Kong, Caihua
    Dai, Longgang
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 587 - 591