COMAL: compositional multi-scale feature enhanced learning for crowd counting

被引:0
|
作者
Fangbo Zhou
Huailin Zhao
Yani Zhang
Qing Zhang
Lanjun Liang
Yaoyao Li
Zuodong Duan
机构
[1] Shanghai Institute of Technology,School of Electrical and Electronic Engineering
[2] Shanghai Institute of Technology,School of Computer Science and Information Engineering
[3] Beijing Institute of Technology,Science and Technology on Electromechanical Dynamic Control Laboratory, School of Mechatronical Engineering
来源
关键词
Crowd counting; Crowd density estimation; Convolutional neural network; Multi-scale feature learning;
D O I
暂无
中图分类号
学科分类号
摘要
Accurately modeling the crowd’s head scale variations is an effective way to improve the counting accuracy of the crowd counting methods. Most counting networks apply a multi-branch network structure to obtain different scales of head features. Although they have achieved promising results, they do not perform very well on the extreme scale variation scene due to the limited scale representability. Meanwhile, these methods are prone to recognize background objects as foreground crowds in complex scenes due to the limited context and high-level semantic information. We propose a compositional multi-scale feature enhanced learning approach (COMAL) for crowd counting to handle the above limitations. COMAL enhances the multi-scale feature representations from three aspects: (1) The semantic enhanced module (SEM) is developed for embedding the high-level semantic information to the multi-scale features; (2) The diversity enhanced module (DEM) is proposed to enrich the variety of crowd features’ different scales; (3) The context enhanced module (CEM) is designed for strengthening the multi-scale features with more context information. Based on the proposed COMAL, we develop a crowd counting network under the encoder-decoder framework and perform extensive experiments on ShanghaiTech, UCF_CC_50, and UCF-QNRF datasets. Qualitative and quantitive results demonstrate the effectiveness of the proposed COMAL.
引用
收藏
页码:20541 / 20560
页数:19
相关论文
共 50 条
  • [31] MSR-FAN: Multi-scale residual feature-aware network for crowd counting
    Zhao, Haoyu
    Min, Weidong
    Wei, Xin
    Wang, Qi
    Fu, Qiyan
    Wei, Zitai
    IET IMAGE PROCESSING, 2021, 15 (14) : 3512 - 3521
  • [32] A Multi-Scale Feature Fusion Network With Cascaded Supervision for Cross-Scene Crowd Counting
    Zhang, Xinfeng
    Han, Lina
    Shan, Wencong
    Wang, Xiaohu
    Chen, Shuhan
    Zhu, Congcong
    Li, Bin
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [33] MULTI-STEP QUANTIZATION OF A MULTI-SCALE NETWORK FOR CROWD COUNTING
    Shim, Kyujin
    Byun, Junyoung
    Kim, Changick
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 683 - 687
  • [34] A multi-scale fusion and dual attention network for crowd counting
    De Zhang
    Yiting Wang
    Xiaoping Zhou
    Liangliang Su
    Multimedia Tools and Applications, 2025, 84 (13) : 11269 - 11294
  • [35] MSIANet: Multi-scale Interactive Attention Crowd Counting Network
    Zhang, Shihui
    Zhao, Weibo
    Wang, Lei
    Wang, Wei
    Li, Qunpeng
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2236 - 2245
  • [36] Dense Crowd Counting Network Based on Multi-scale Perception
    Li, Hengchao
    Liu, Xianglian
    Liu, Peng
    Feng, Bin
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2024, 59 (05): : 1176 - 1183
  • [37] MHANet: Multi-scale hybrid attention network for crowd counting
    Yu, Ying
    Yu, Jiamao
    Qian, Jin
    Zhu, Zhiliang
    Han, Xing
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 9445 - 9455
  • [38] Compare and Focus: Multi-Scale View Aggregation for Crowd Counting
    Jiang, Shengqin
    Cai, Jialu
    Zhang, Haokui
    Liu, Yu
    Liu, Qingshan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 13231 - 13239
  • [39] Multi-Scale Network with Integrated Attention Unit for Crowd Counting
    Hafeezallah, Adel
    Al-Dhamari, Ahlam
    Abu-Bakar, Syed Abd Rahman
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3879 - 3903
  • [40] LEVERAGE MULTI-SCALE DILATED CONVOLUTIONAL NEURAL NETWORK WITH GLOBAL ATTENTION FEATURE FUSION FOR CROWD COUNTING
    Lv, Meilei
    Zhang, Kuncai
    Zheng, Xiaoyun
    Yang, W. E., I
    Lu, Zhe-Ming
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2022, 18 (04): : 1147 - 1162