Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization

被引:142
|
作者
Lian, Dongze [1 ]
Li, Jing [1 ]
Zheng, Jia [1 ]
Luo, Weixin [1 ,2 ]
Gao, Shenghua [1 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
[2] Yoke Intelligence, Copenhagen, Denmark
关键词
D O I
10.1109/CVPR.2019.00192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To simultaneously estimate head counts and localize heads with bounding boxes, a regression guided detection network (RDNet) is proposed for RGB-D crowd counting. Specifically, to improve the robustness of detection-based approaches for small/tiny heads, we leverage density map to improve the head/non-head classification in detection network where density map serves as the probability of a pixel being a head. A depth-adaptive kernel that considers the variances in head sizes is also introduced to generate high-fidelity density map for more robust density map regression. Further, a depth-aware anchor is designed for better initialization of anchor sizes in detection framework. Then we use the bounding boxes whose sizes are estimated with depth to train our RDNet. The existing RGB-D datasets are too small and not suitable for performance evaluation on data-driven based approaches, we collect a large-scale RGB-D crowd counting dataset. Experiments on both our RGB-D dataset and the MICC RGB-D counting dataset show that our method achieves the best performance for RGB-D crowd counting and localization. Further, our method can be readily extended to RGB image based crowd counting and achieves comparable performance on the Shang-haiTech Part_B dataset for both counting and localization.
引用
收藏
页码:1821 / 1830
页数:10
相关论文
共 50 条
  • [31] Planes Detection for Robust Localization and Mapping in RGB-D SLAM systems
    ElGhor, Hakim ElChaoui
    Roussel, David
    Ababsa, Fakhreddine
    Bouyakhf, El Houssine
    2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, : 452 - 459
  • [32] Attention-guided Multi-modality Interaction Network for RGB-D Salient Object Detection
    Wang, Ruimin
    Wang, Fasheng
    Su, Yiming
    Sun, Jing
    Sun, Fuming
    Li, Haojie
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (03)
  • [33] Cross-modal refined adjacent-guided network for RGB-D salient object detection
    Bi H.
    Zhang J.
    Wu R.
    Tong Y.
    Jin W.
    Multimedia Tools Appl, 24 (37453-37478): : 37453 - 37478
  • [34] Dual attention guided multi-scale fusion network for RGB-D salient object detection
    Gao, Huan
    Guo, Jichang
    Wang, Yudong
    Dong, Jianan
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 118
  • [35] Crowd counting method via a dynamic-refined density map network
    Liu, Yanbo
    Cao, Guo
    Ge, Zixian
    Hu, Yingxiang
    NEUROCOMPUTING, 2022, 497 : 191 - 203
  • [36] Building change detection with RGB-D map generated from UAV images
    Chen, Baohua
    Chen, Zhixiang
    Deng, Lei
    Duan, Yueqi
    Zhou, Jie
    NEUROCOMPUTING, 2016, 208 : 350 - 364
  • [37] Robust Localization Using RGB-D Images
    Oh, Yoonseon
    Oh, Songhwai
    2014 14TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2014), 2014, : 1023 - 1026
  • [38] Grid Map Guided Indoor 3D Reconstruction for Mobile Robots with RGB-D Sensors
    Zhang, Boyu
    Zhang, Xuebo
    Chen, Xiang
    Fang, Yongchun
    2018 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2018, : 498 - 503
  • [39] DGT: Depth-guided RGB-D occluded target detection with transformers
    Xu, Kelei
    Wang, Chunyan
    Zhao, Wanzhong
    Liu, Jinqiang
    APPLIED INTELLIGENCE, 2025, 55 (04)
  • [40] Foreground Mask Guided Network for Crowd Counting
    Li, Chun
    Shang, Lin
    Xu, Suping
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 322 - 334