Density Map Regression Guided Detection Network for RGB-D Crowd Counting and Localization

被引:142
|
作者
Lian, Dongze [1 ]
Li, Jing [1 ]
Zheng, Jia [1 ]
Luo, Weixin [1 ,2 ]
Gao, Shenghua [1 ]
机构
[1] ShanghaiTech Univ, Shanghai, Peoples R China
[2] Yoke Intelligence, Copenhagen, Denmark
关键词
D O I
10.1109/CVPR.2019.00192
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To simultaneously estimate head counts and localize heads with bounding boxes, a regression guided detection network (RDNet) is proposed for RGB-D crowd counting. Specifically, to improve the robustness of detection-based approaches for small/tiny heads, we leverage density map to improve the head/non-head classification in detection network where density map serves as the probability of a pixel being a head. A depth-adaptive kernel that considers the variances in head sizes is also introduced to generate high-fidelity density map for more robust density map regression. Further, a depth-aware anchor is designed for better initialization of anchor sizes in detection framework. Then we use the bounding boxes whose sizes are estimated with depth to train our RDNet. The existing RGB-D datasets are too small and not suitable for performance evaluation on data-driven based approaches, we collect a large-scale RGB-D crowd counting dataset. Experiments on both our RGB-D dataset and the MICC RGB-D counting dataset show that our method achieves the best performance for RGB-D crowd counting and localization. Further, our method can be readily extended to RGB image based crowd counting and achieves comparable performance on the Shang-haiTech Part_B dataset for both counting and localization.
引用
收藏
页码:1821 / 1830
页数:10
相关论文
共 50 条
  • [1] Crowd Counting and Localization Beyond Density Map
    Khan, Akbar
    Kadir, Kushsairy
    Nasir, Haidawati
    Shah, Jawad Ali
    Albattah, Waleed
    Khan, Sheroz
    Kakakhel, Muhammad Haris
    IEEE ACCESS, 2022, 10 : 133142 - 133151
  • [2] CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting
    Liu, Yanbo
    Cao, Guo
    Shi, Boshan
    Hu, Yingxiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 154 - 165
  • [3] Fully Convolutional Network for Crowd Size Estimation by Density Map and Counting Regression
    Wu, Bing-Fei
    Lin, Chun-Hsien
    2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, : 2170 - 2175
  • [4] REAL-TIME ACCURATE CROWD COUNTING BASED ON RGB-D INFORMATION
    Fu, Huiyuan
    Ma, Huadong
    Xiao, Hongtian
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 2685 - 2688
  • [5] An RGB-D Semantic Map Building and Global Localization Method
    Fu, Tianqi
    Tian, Facun
    Ma, Lei
    Sun, Yongkui
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 979 - 984
  • [6] Perceptual localization and focus refinement network for RGB-D salient object detection
    Han, Jinyu
    Wang, Mengyin
    Wu, Weiyi
    Jia, Xu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [7] A point and density map hybrid network for crowd counting and localization based on unmanned aerial vehicles
    Zhao, Lei
    Bao, Zhengwei
    Xie, Zhijun
    Huang, Guangyan
    Rehman, Zeeshan Ur
    CONNECTION SCIENCE, 2022, 34 (01) : 2481 - 2499
  • [8] Multi-density map fusion network for crowd counting
    Wang, Yongjie
    Zhang, Wei
    Liu, Yanyan
    Zhu, Jianghua
    NEUROCOMPUTING, 2020, 397 : 31 - 38
  • [9] Correlation-attention guided regression network for efficient crowd counting
    Zeng, Xin
    Wang, Huake
    Guo, Qiang
    Wu, Yunpeng
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 99
  • [10] RGB-D Map for Robot Navigation
    Duchon, Frantisek
    Toelgyessy, Michal
    Chovanec, L'ubos
    Paszto, Peter
    Babinec, Andrej
    Gardian, Pavol
    2014 ELEKTRO, 2014, : 154 - 158