COMAL: compositional multi-scale feature enhanced learning for crowd counting

被引:8
|
作者
Zhou, Fangbo [1 ]
Zhao, Huailin [1 ]
Zhang, Yani [2 ]
Zhang, Qing [2 ]
Liang, Lanjun [1 ]
Li, Yaoyao [1 ]
Duan, Zuodong [3 ]
机构
[1] Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai, Peoples R China
[2] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Mechatron Engn, Sci & Technol Electromech Dynam Control Lab, Beijing, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Crowd counting; Crowd density estimation; Convolutional neural network; Multi-scale feature learning;
D O I
10.1007/s11042-022-12249-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurately modeling the crowd's head scale variations is an effective way to improve the counting accuracy of the crowd counting methods. Most counting networks apply a multi-branch network structure to obtain different scales of head features. Although they have achieved promising results, they do not perform very well on the extreme scale variation scene due to the limited scale representability. Meanwhile, these methods are prone to recognize background objects as foreground crowds in complex scenes due to the limited context and high-level semantic information. We propose a compositional multi-scale feature enhanced learning approach (COMAL) for crowd counting to handle the above limitations. COMAL enhances the multi-scale feature representations from three aspects: (1) The semantic enhanced module (SEM) is developed for embedding the high-level semantic information to the multi-scale features; (2) The diversity enhanced module (DEM) is proposed to enrich the variety of crowd features' different scales; (3) The context enhanced module (CEM) is designed for strengthening the multi-scale features with more context information. Based on the proposed COMAL, we develop a crowd counting network under the encoder-decoder framework and perform extensive experiments on ShanghaiTech, UCF_CC_50, and UCF-QNRF datasets. Qualitative and quantitive results demonstrate the effectiveness of the proposed COMAL.
引用
收藏
页码:20541 / 20560
页数:20
相关论文
共 50 条
  • [21] Crowd Counting by Multi-Scale Dilated Convolution Networks
    Dong, Jingwei
    Zhao, Ziqi
    Wang, Tongxin
    ELECTRONICS, 2023, 12 (12)
  • [22] Multi-scale Generative Adversarial Networks for Crowd Counting
    Yang, Jianxing
    Zhou, Yuan
    Kung, Sun-Yuan
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3244 - 3249
  • [23] Redesigning Multi-Scale Neural Network for Crowd Counting
    Du, Zhipeng
    Shi, Miaojing
    Deng, Jiankang
    Zafeiriou, Stefanos
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 3664 - 3678
  • [24] Multi-Scale Guided Attention Network for Crowd Counting
    Li, Pengfei
    Zhang, Min
    Wan, Jian
    Jiang, Ming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [25] Multi-scale Attention Recalibration Network for crowd counting
    Xie, Jinyang
    Pang, Chen
    Zheng, Yanjun
    Li, Liang
    Lyu, Chen
    Lyu, Lei
    Liu, Hong
    APPLIED SOFT COMPUTING, 2022, 117
  • [26] MULTI-SCALE CONVOLUTIONAL NEURAL NETWORKS FOR CROWD COUNTING
    Zeng, Lingke
    Xu, Xiangmin
    Cai, Bolun
    Qiu, Suo
    Zhang, Tong
    2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 465 - 469
  • [27] STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
    Wang, Mingjie
    Cai, Hao
    Zhou, Jun
    Gong, Minglun
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 2008 - 2012
  • [28] Crowd counting via learning perspective for multi-scale multi-view Web images
    Shang, Chong
    Ai, Haizhou
    Yang, Yi
    FRONTIERS OF COMPUTER SCIENCE, 2019, 13 (03) : 579 - 587
  • [29] Crowd counting via learning perspective for multi-scale multi-view Web images
    Chong Shang
    Haizhou Ai
    Yi Yang
    Frontiers of Computer Science, 2019, 13 : 579 - 587
  • [30] Anti-Background Interference Crowd Counting Network Based on Multi-scale Feature Fusion
    Yu, Ying
    Li, Jianfei
    Qian, Jin
    Cai, Zhen
    Zhu, Zhiliang
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (10): : 915 - 927