Temporal Extension for Encoder-Decoder-based Crowd Counting Approaches

被引:0
|
作者
Golda, Thomas [1 ,2 ]
Kruger, Florian [2 ]
Beyerer, Jurgen [1 ,2 ]
机构
[1] Karlsruhe Inst Technol KIT, Vis & Fus Lab, Karlsruhe, Germany
[2] Fraunhofer Inst Optron Syst Technol & Image Explo, Fraunhofer Ctr Machine Learning, Karlsruhe, Germany
关键词
D O I
10.23919/MVA51890.2021.9511351
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting is an important aspect to safety monitoring at mass events and can be used to initiate safety measures in time. State-of-the-art encoder-decoder architectures are able to estimate the number of people in a scene precisely. However, since most of the proposed methods are based to solely operate on single-image features, we observe that estimated counts for aerial video sequences are inherently noisy, which in turn reduces the significance of the overall estimates. In this paper, we propose a simple temporal extension to said encoder-decoder architectures that incorporates local context from multiple frames into the estimation process. By applying the temporal extension a state-of-the-art architectures and exploring multiple configuration settings, we find that the resulting estimates are more precise and smoother over time.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Attentive encoder-decoder networks for crowd counting
    Liu, Xuhui
    Hu, Yutao
    Zhang, Baochang
    Zhen, Xiantong
    Luo, Xiaoyan
    Cao, Xianbin
    [J]. NEUROCOMPUTING, 2022, 490 : 246 - 257
  • [2] Dense Encoder-Decoder-Based Architecture for Skin Lesion Segmentation
    Qamar, Saqib
    Ahmad, Parvez
    Shen, Linlin
    [J]. COGNITIVE COMPUTATION, 2021, 13 (02) : 583 - 594
  • [3] Automated Antenna Testing Using Encoder-Decoder-based Anomaly Detection
    Hsu, Hans Hao-Hsun
    Xu, Jiawen
    Sama, Ravi
    Kovatsch, Matthias
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 135 - 142
  • [4] An Encoder-Decoder-Based Super Resolution Network for License Plate Images
    Xu, Shengjun
    Deng, Bowen
    Shi, Ya
    Meng, Yuebo
    Liu, Guanghui
    Han, Jiuqiang
    [J]. Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2022, 56 (10): : 101 - 110
  • [5] Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks
    Jiang, Xiaolong
    Xiao, Zehao
    Zhang, Baochang
    Zhen, Xiantong
    Cao, Xianbin
    Doermann, David
    Shao, Ling
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 6126 - 6135
  • [6] Assessment of encoder-decoder-based segmentation models for thyroid ultrasound images
    Yadav, Niranjan
    Dass, Rajeshwar
    Virmani, Jitendra
    [J]. MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2023, 61 (08) : 2159 - 2195
  • [7] Assessment of encoder-decoder-based segmentation models for thyroid ultrasound images
    Niranjan Yadav
    Rajeshwar Dass
    Jitendra Virmani
    [J]. Medical & Biological Engineering & Computing, 2023, 61 : 2159 - 2195
  • [8] Ship trajectory prediction using encoder-decoder-based deep learning models
    Duez, Buelent
    van Iperen, Erwin
    [J]. JOURNAL OF LOCATION BASED SERVICES, 2024,
  • [9] Curiosity Guided Fine-Tuning for Encoder-Decoder-Based Visual Forecasting
    Kamikawa, Yuta
    Hashimoto, Atsushi
    Sonogashira, Motoharu
    Iiyama, Masaaki
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (05) : 752 - 761
  • [10] An Effective Lightweight Crowd Counting Method Based on an Encoder-Decoder Network for Internet of Video Things
    Yi, Jun
    Chen, Fan
    Shen, Zhilong
    Xiang, Yi
    Xiao, Shan
    Zhou, Wei
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (02): : 3082 - 3094