Depth-Guided Progressive Network for Object Detection

被引:2
|
作者
Ma, Jia-Wei [1 ,2 ]
Liang, Min [2 ]
Chen, Song-Lu [1 ,2 ]
Chen, Feng [3 ]
Tian, Shu [2 ]
Qin, Jingyan [4 ]
Yin, Xu-Cheng [1 ,2 ]
机构
[1] Univ Sci & Technol Beijing, USTB EEasyTech Joint Lab Artificial Intelligence, Beijing 100083, Peoples R China
[2] Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China
[3] EEasy Technol Co Ltd, Zhuhai 519000, Peoples R China
[4] Univ Sci & Technol Beijing, Dept Ind Design, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Detectors; Interference; Signal to noise ratio; Semantics; Location awareness; multi-scale object; depth-guided; progressive sampling;
D O I
10.1109/TITS.2022.3156365
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Multi-scale object detection in natural scenes is still challenging. To enhance the multi-scale perception capability, some algorithms combine the lower-level and higher-level information via multi-scale feature fusion strategies. However, the inherent spatial properties among instances and relations between foreground and background are ignored. In addition, the human-defined ``center-based'' regression quality evaluation strategy, predicting a high-to-low score based on a linear relationship with the distance to the center of ground-truth box, is not robust to scale-variant objects. In this work, we propose a Depth-Guided Progressive Network (DGPNet) for multi-scale object detection. Specifically, besides the prediction of classification and localization, the depth is estimated and used to guide the image features in a weighted manner to obtain a better spatial representation. Therefore, depth estimation and 2D object detection are simultaneously learned via a unified network, where the depth features are merged as auxiliary information into the detection branch to enhance the discrimination among multi-scale objects. Moreover, to overcome the difficulty of empirically fitting the localization quality function, high-quality predicted boxes on scale-variant objects are more adaptively obtained by an IoU-aware progressive sampling strategy. We divide the sampling process into two stages, i.e., ``statistical-aware'' and ``IoU-aware''. The former selects thresholds for positive samples based on statistical characteristics of multi-scale instances, and the latter further selects high-quality samples by IoU on the basis of the former. Therefore, the final ranking scores better reflect the quality of localization. Experiments verify that our method outperforms state-of-the-art methods on the KINS and Cityscapes dataset.
引用
收藏
页码:19523 / 19533
页数:11
相关论文
共 50 条
  • [41] Progressive Self-Guided Loss for Salient Object Detection
    Yang, Sheng
    Lin, Weisi
    Lin, Guosheng
    Jiang, Qiuping
    Liu, Zichuan
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 8426 - 8438
  • [42] DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation
    Cao, Tuo
    Luo, Fei
    Fu, Yanping
    Zhang, Wenxiao
    Zheng, Shengjie
    Xiao, Chunxia
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 3773 - 3782
  • [43] Enhancing View Synthesis with Depth-Guided Neural Radiance Fields and Improved Depth Completion
    Wang, Bojun
    Zhang, Danhong
    Su, Yixin
    Zhang, Huajun
    SENSORS, 2024, 24 (06)
  • [44] Depth guided feature selection for RGBD salient object detection
    Li, Zun
    Lang, Congyan
    Li, Guanqin
    Wang, Tao
    Li, Yidong
    NEUROCOMPUTING, 2023, 519 : 57 - 68
  • [45] Guided residual network for RGB-D salient object detection with efficient depth feature learning
    Wang, Jian
    Chen, Shuhan
    Lv, Xiao
    Xu, Xiuqi
    Hu, Xuelong
    VISUAL COMPUTER, 2022, 38 (05): : 1803 - 1814
  • [46] Guided residual network for RGB-D salient object detection with efficient depth feature learning
    Jian Wang
    Shuhan Chen
    Xiao Lv
    Xiuqi Xu
    Xuelong Hu
    The Visual Computer, 2022, 38 : 1803 - 1814
  • [47] Progressive Feature Polishing Network for Salient Object Detection
    Wang, Bo
    Chen, Quan
    Zhou, Min
    Zhang, Zhiqiang
    Jin, Xiaogang
    Gai, Kun
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 12128 - 12135
  • [48] A Simple Network with Progressive Structure for Salient Object Detection
    Zhou, Boyi
    Yang, Gang
    Wan, Xin
    Wang, Yutao
    Liu, Chang
    Wang, Hangxu
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 397 - 408
  • [49] Depth alignment interaction network for camouflaged object detection
    Bi, Hongbo
    Tong, Yuyu
    Zhang, Jiayuan
    Zhang, Cong
    Tong, Jinghui
    Jin, Wei
    MULTIMEDIA SYSTEMS, 2024, 30 (01)
  • [50] Depth context aggregation network for camouflaged object detection
    Liu, Xiaogang
    Song, Shuang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (31) : 75689 - 75708