COMAL: compositional multi-scale feature enhanced learning for crowd counting

被引:8
|
作者
Zhou, Fangbo [1 ]
Zhao, Huailin [1 ]
Zhang, Yani [2 ]
Zhang, Qing [2 ]
Liang, Lanjun [1 ]
Li, Yaoyao [1 ]
Duan, Zuodong [3 ]
机构
[1] Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai, Peoples R China
[2] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai, Peoples R China
[3] Beijing Inst Technol, Sch Mechatron Engn, Sci & Technol Electromech Dynam Control Lab, Beijing, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Crowd counting; Crowd density estimation; Convolutional neural network; Multi-scale feature learning;
D O I
10.1007/s11042-022-12249-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurately modeling the crowd's head scale variations is an effective way to improve the counting accuracy of the crowd counting methods. Most counting networks apply a multi-branch network structure to obtain different scales of head features. Although they have achieved promising results, they do not perform very well on the extreme scale variation scene due to the limited scale representability. Meanwhile, these methods are prone to recognize background objects as foreground crowds in complex scenes due to the limited context and high-level semantic information. We propose a compositional multi-scale feature enhanced learning approach (COMAL) for crowd counting to handle the above limitations. COMAL enhances the multi-scale feature representations from three aspects: (1) The semantic enhanced module (SEM) is developed for embedding the high-level semantic information to the multi-scale features; (2) The diversity enhanced module (DEM) is proposed to enrich the variety of crowd features' different scales; (3) The context enhanced module (CEM) is designed for strengthening the multi-scale features with more context information. Based on the proposed COMAL, we develop a crowd counting network under the encoder-decoder framework and perform extensive experiments on ShanghaiTech, UCF_CC_50, and UCF-QNRF datasets. Qualitative and quantitive results demonstrate the effectiveness of the proposed COMAL.
引用
收藏
页码:20541 / 20560
页数:20
相关论文
共 50 条
  • [1] COMAL: compositional multi-scale feature enhanced learning for crowd counting
    Fangbo Zhou
    Huailin Zhao
    Yani Zhang
    Qing Zhang
    Lanjun Liang
    Yaoyao Li
    Zuodong Duan
    Multimedia Tools and Applications, 2022, 81 : 20541 - 20560
  • [2] Multi-scale Feature Aggregation for Crowd Counting
    Jiang, Xiaoheng
    Wu, Xinyi
    Cholakkal, Hisham
    Anwer, Rao Muhammad
    Cao, Jiale
    Xu, Mingliang
    Zhou, Bing
    Pang, Yanwei
    Khan, Fahad Shahbaz
    arXiv, 2022,
  • [3] Crowd Counting based on Multi-level Multi-scale Feature
    Di Wu
    Zheyi Fan
    Shuhan Yi
    Applied Intelligence, 2023, 53 : 21891 - 21901
  • [4] Crowd Counting based on Multi-level Multi-scale Feature
    Wu, Di
    Fan, Zheyi
    Yi, Shuhan
    APPLIED INTELLIGENCE, 2023, 53 (19) : 21891 - 21901
  • [5] Double multi-scale feature fusion network for crowd counting
    Liu, Qian
    Fang, Jiongtao
    Zhong, Yixiong
    Wang, Cunbao
    Qi, Youwei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (34) : 81831 - 81855
  • [6] Crowd Counting Method Based on Multi-Scale Enhanced Network
    Xu Tao
    Duan Yinong
    Du Jiahao
    Liu Caihua
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (06) : 1764 - 1771
  • [7] A multi-scale and multi-level feature aggregation network for crowd counting
    Zhu, Fushun
    Yan, Hua
    Chen, Xinyue
    Li, Tong
    Zhang, Zhengyu
    NEUROCOMPUTING, 2021, 423 : 46 - 56
  • [8] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Donghua Liu
    Guodong Wang
    Guangtao Zhai
    Multimedia Tools and Applications, 2022, 81 : 37939 - 37952
  • [9] Multi-scale dilated convolution of feature Fusion Network for Crowd counting
    Liu, Donghua
    Wang, Guodong
    Zhai, Guangtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37939 - 37952
  • [10] MSFFA: a multi-scale feature fusion and attention mechanism network for crowd counting
    Li, Zhaoxin
    Lu, Shuhua
    Dong, Yishan
    Guo, Jingyuan
    VISUAL COMPUTER, 2023, 39 (03): : 1045 - 1056