Depth-Guided Progressive Network for Object Detection

被引：2

作者：

Ma, Jia-Wei ^{[1
,2
]}

Liang, Min ^{[2
]}

Chen, Song-Lu ^{[1
,2
]}

Chen, Feng ^{[3
]}

Tian, Shu ^{[2
]}

Qin, Jingyan ^{[4
]}

Yin, Xu-Cheng ^{[1
,2
]}

机构：

[1] Univ Sci & Technol Beijing, USTB EEasyTech Joint Lab Artificial Intelligence, Beijing 100083, Peoples R China

[2] Univ Sci & Technol Beijing, Dept Comp Sci & Technol, Beijing 100083, Peoples R China

[3] EEasy Technol Co Ltd, Zhuhai 519000, Peoples R China

[4] Univ Sci & Technol Beijing, Dept Ind Design, Beijing 100083, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2022年 / 23卷 / 10期

基金：

中国国家自然科学基金;

关键词：

Feature extraction; Object detection; Detectors; Interference; Signal to noise ratio; Semantics; Location awareness; multi-scale object; depth-guided; progressive sampling;

D O I：

10.1109/TITS.2022.3156365

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Multi-scale object detection in natural scenes is still challenging. To enhance the multi-scale perception capability, some algorithms combine the lower-level and higher-level information via multi-scale feature fusion strategies. However, the inherent spatial properties among instances and relations between foreground and background are ignored. In addition, the human-defined ``center-based'' regression quality evaluation strategy, predicting a high-to-low score based on a linear relationship with the distance to the center of ground-truth box, is not robust to scale-variant objects. In this work, we propose a Depth-Guided Progressive Network (DGPNet) for multi-scale object detection. Specifically, besides the prediction of classification and localization, the depth is estimated and used to guide the image features in a weighted manner to obtain a better spatial representation. Therefore, depth estimation and 2D object detection are simultaneously learned via a unified network, where the depth features are merged as auxiliary information into the detection branch to enhance the discrimination among multi-scale objects. Moreover, to overcome the difficulty of empirically fitting the localization quality function, high-quality predicted boxes on scale-variant objects are more adaptively obtained by an IoU-aware progressive sampling strategy. We divide the sampling process into two stages, i.e., ``statistical-aware'' and ``IoU-aware''. The former selects thresholds for positive samples based on statistical characteristics of multi-scale instances, and the latter further selects high-quality samples by IoU on the basis of the former. Therefore, the final ranking scores better reflect the quality of localization. Experiments verify that our method outperforms state-of-the-art methods on the KINS and Cityscapes dataset.

引用

页码：19523 / 19533

页数：11

共 50 条

[21] DXYW: a depth-guided multi-channel edge detection model
Chuan Lin
Qu Wang
Shujuan Wan
Signal, Image and Video Processing, 2023, 17 : 481 - 489
[22] Depth-Guided Dense Dynamic Filtering Network for Bokeh Effect Rendering
Purohit, Kuldeep
Suin, Maitreya
Kandula, Praveen
Ambasamudram, Rajagopalan
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3417 - 3426
[23] Depth-Guided Dehazing Network for Long-Range Aerial Scenes
Wang, Yihu
Zhao, Jilin
Yao, Liangliang
Fu, Changhong
REMOTE SENSING, 2024, 16 (12)
[24] DGT: Depth-guided RGB-D occluded target detection with transformers
Kelei Xu
Chunyan Wang
Wanzhong Zhao
Jinqiang Liu
Applied Intelligence, 2025, 55 (5)
[25] Depth-guided deep filtering network for efficient single image bokeh rendering
Quan Chen
Bolun Zheng
Xiaofei Zhou
Aiai Huang
Yaoqi Sun
Chuqiao Chen
Chenggang Yan
Shanxin Yuan
Neural Computing and Applications, 2023, 35 : 20869 - 20887
[26] Semantic attention and relative scene depth-guided network for underwater image enhancement
Chen, Tingkai
Wang, Ning
Chen, Yanzheng
Kong, Xiangjun
Lin, Yejin
Zhao, Hong
Karimi, Hamid Reza
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 123
[27] A Depth-Guided Attention Strategy for Crowd Counting
Chen, Hao
Li, Zhan
Bhanu, Bir
Lu, Dongping
Han, Xuming
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PART X, 2023, 14263 : 25 - 37
[28] Depth-guided deep filtering network for efficient single image bokeh rendering
Chen, Quan
Zheng, Bolun
Zhou, Xiaofei
Huang, Aiai
Sun, Yaoqi
Chen, Chuqiao
Yan, Chenggang
Yuan, Shanxin
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (28): : 20869 - 20887
[29] Tracking Based Depth-guided Video Inpainting
Hatheele, Saroj
Zaveri, Mukesh A.
2013 FOURTH NATIONAL CONFERENCE ON COMPUTER VISION, PATTERN RECOGNITION, IMAGE PROCESSING AND GRAPHICS (NCVPRIPG), 2013,
[30] Unpaired Image Dehazing With Physical-Guided Restoration and Depth-Guided Refinement
Chen, Xiang
Li, Yufeng
Kong, Caihua
Dai, Longgang
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 587 - 591

← 1 2 3 4 5 →