Temporal Extension for Encoder-Decoder-based Crowd Counting Approaches

被引:0
|
作者
Golda, Thomas [1 ,2 ]
Kruger, Florian [2 ]
Beyerer, Jurgen [1 ,2 ]
机构
[1] Karlsruhe Inst Technol KIT, Vis & Fus Lab, Karlsruhe, Germany
[2] Fraunhofer Inst Optron Syst Technol & Image Explo, Fraunhofer Ctr Machine Learning, Karlsruhe, Germany
关键词
D O I
10.23919/MVA51890.2021.9511351
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting is an important aspect to safety monitoring at mass events and can be used to initiate safety measures in time. State-of-the-art encoder-decoder architectures are able to estimate the number of people in a scene precisely. However, since most of the proposed methods are based to solely operate on single-image features, we observe that estimated counts for aerial video sequences are inherently noisy, which in turn reduces the significance of the overall estimates. In this paper, we propose a simple temporal extension to said encoder-decoder architectures that incorporates local context from multiple frames into the estimation process. By applying the temporal extension a state-of-the-art architectures and exploring multiple configuration settings, we find that the resulting estimates are more precise and smoother over time.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] A Spatio-Temporal Attentive Network for Video-Based Crowd Counting
    Avvenuti, Marco
    Bongiovanni, Marco
    Ciampi, Luca
    Falchi, Fabrizio
    Gennaro, Claudio
    Messina, Nicola
    [J]. 2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,
  • [42] Fast video crowd counting with a temporal aware network
    East China Normal University, Shanghai
    200062, China
    不详
    200023, China
    不详
    201203, China
    [J]. arXiv, 2019,
  • [43] Fast video crowd counting with a Temporal Aware Network
    Wu, Xingjiao
    Xu, Baohan
    Zheng, Yingbin
    Ye, Hao
    Yang, Jing
    He, Liang
    [J]. NEUROCOMPUTING, 2020, 403 : 13 - 20
  • [45] River flood prediction through flow level modeling using multi-attention encoder-decoder-based TCN with filter-wrapper feature selection
    Jeba, G. Selva
    Chitra, P.
    [J]. EARTH SCIENCE INFORMATICS, 2024,
  • [46] A Continuous Encoder-Decoder Method for Spatial-Temporal Forecasting
    Liu, Lei
    Shen, Yanming
    Qi, Heng
    [J]. 2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 835 - 842
  • [47] Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
    Chen, Jingwen
    Pan, Yingwei
    Li, Yehao
    Yao, Ting
    Chao, Hongyang
    Mei, Tao
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8167 - 8174
  • [48] Crowd counting based on Difference Images
    Chen, Jinyan
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (02) : 65 - 68
  • [49] GTL-ASENet: global to local adaptive spatial encoder network for crowd counting
    Liu, Chengming
    Hu, Guanzhong
    Li, Yinghao
    Gao, Yufei
    Shi, Lei
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (22) : 61697 - 61714
  • [50] Spatial-Temporal Graph Network for Video Crowd Counting
    Wu, Zhe
    Zhang, Xinfeng
    Tian, Geng
    Wang, Yaowei
    Huang, Qingming
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 228 - 241