Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization

被引：142

作者：

Lian, Dongze ^{[1
]}

Li, Jing ^{[1
]}

Zheng, Jia ^{[1
]}

Luo, Weixin ^{[1
,2
]}

Gao, Shenghua ^{[1
]}

机构：

[1] ShanghaiTech Univ, Shanghai, Peoples R China

[2] Yoke Intelligence, Copenhagen, Denmark

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00192

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

To simultaneously estimate head counts and localize heads with bounding boxes, a regression guided detection network (RDNet) is proposed for RGB-D crowd counting. Specifically, to improve the robustness of detection-based approaches for small/tiny heads, we leverage density map to improve the head/non-head classification in detection network where density map serves as the probability of a pixel being a head. A depth-adaptive kernel that considers the variances in head sizes is also introduced to generate high-fidelity density map for more robust density map regression. Further, a depth-aware anchor is designed for better initialization of anchor sizes in detection framework. Then we use the bounding boxes whose sizes are estimated with depth to train our RDNet. The existing RGB-D datasets are too small and not suitable for performance evaluation on data-driven based approaches, we collect a large-scale RGB-D crowd counting dataset. Experiments on both our RGB-D dataset and the MICC RGB-D counting dataset show that our method achieves the best performance for RGB-D crowd counting and localization. Further, our method can be readily extended to RGB image based crowd counting and achieves comparable performance on the Shang-haiTech Part_B dataset for both counting and localization.

引用

页码：1821 / 1830

页数：10

共 50 条

[31] Planes Detection for Robust Localization and Mapping in RGB-D SLAM systems
ElGhor, Hakim ElChaoui
Roussel, David
Ababsa, Fakhreddine
Bouyakhf, El Houssine
2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 452 - 459
[32] Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection
Wang, Ruimin
Wang, Fasheng
Su, Yiming
Sun, Jing
Sun, Fuming
Li, Haojie
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
[33] Cross-modal refined adjacent-guided network for RGB-D salient object detection
Bi H.
Zhang J.
Wu R.
Tong Y.
Jin W.
Multimedia Tools Appl, 24 (37453-37478): : 37453 - 37478
[34] Dual attention guided multi-scale fusion network for RGB-D salient object detection
Gao, Huan
Guo, Jichang
Wang, Yudong
Dong, Jianan
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
[35] Crowd counting method via a dynamic-refined density map network
Liu, Yanbo
Cao, Guo
Ge, Zixian
Hu, Yingxiang
NEUROCOMPUTING, 2022, 497 : 191 - 203
[36] Building change detection with RGB-D map generated from UAV images
Chen, Baohua
Chen, Zhixiang
Deng, Lei
Duan, Yueqi
Zhou, Jie
NEUROCOMPUTING, 2016, 208 : 350 - 364
[37] Robust Localization Using RGB-D Images
Oh, Yoonseon
Oh, Songhwai
2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1023 - 1026
[38] Grid Map Guided Indoor 3D Reconstruction for Mobile Robots with RGB-D Sensors
Zhang, Boyu
Zhang, Xuebo
Chen, Xiang
Fang, Yongchun
2018 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2018, : 498 - 503
[39] DGT: Depth-guided RGB-D occluded target detection with transformers
Xu, Kelei
Wang, Chunyan
Zhao, Wanzhong
Liu, Jinqiang
APPLIED INTELLIGENCE, 2025, 55 (04)
[40] Foreground Mask Guided Network for Crowd Counting
Li, Chun
Shang, Lin
Xu, Suping
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 322 - 334

← 1 2 3 4 5 →