Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks

被引:231
|
作者
Jiang, Xiaolong [1 ]
Xiao, Zehao [1 ]
Zhang, Baochang [3 ]
Zhen, Xiantong [4 ]
Cao, Xianbin [1 ,2 ]
Doermann, David [5 ]
Shao, Ling [4 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Minist Ind & Informat Technol China, Key Lab Adv Technol Near Space Informat Syst, Beijing, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[4] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[5] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA
关键词
D O I
10.1109/CVPR.2019.00629
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting has recently attracted increasing interest in computer vision but remains a challenging problem. In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps. The major contributions are four-fold. First, we develop a new trellis architecture that incorporates multiple decoding paths to hierarchically aggregate features at different encoding stages, which improves the representative capability of convolutional features for large variations in objects. Second, we employ dense skip connections interleaved across paths to facilitate sufficient multi-scale feature fusions, which also helps TEDnet to absorb the supervision information. Third, we propose a new combinatorial loss to enforce similarities in local coherence and spatial correlation between maps. By distributedly imposing this combinatorial loss on intermediate outputs, TEDnet can improve the back-propagation process and alleviate the gradient vanishing problem. Finally, on four widely-used benchmarks, our TEDnet achieves the best overall performance in terms of both density map quality and counting accuracy, with an improvement up to 14% in MAE metric. These results validate the effectiveness of TEDnet for crowd counting.
引用
收藏
页码:6126 / 6135
页数:10
相关论文
共 50 条
  • [21] Comparison of Encoder-Decoder Networks for Soccer Field Segmentation
    Guimaraes, Otavio H. R.
    Maximo, Marcos R. O. A.
    Parente de Oliveira, Jose Maria
    [J]. 2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, : 496 - 501
  • [22] Temporal Extension for Encoder-Decoder-based Crowd Counting Approaches
    Golda, Thomas
    Kruger, Florian
    Beyerer, Jurgen
    [J]. PROCEEDINGS OF 17TH INTERNATIONAL CONFERENCE ON MACHINE VISION APPLICATIONS (MVA 2021), 2021,
  • [23] Variational Memory Encoder-Decoder
    Hung Le
    Truyen Tran
    Thin Nguyen
    Venkatesh, Svetha
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Are all shortcuts in encoder-decoder networks beneficial for CT denoising?
    Chen, Junhua
    Zhang, Chong
    Wee, Leonard
    Dekker, Andre
    Bermejo, Inigo
    [J]. COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (01): : 59 - 66
  • [25] Graph Regularized Encoder-Decoder Networks for Image Representation Learning
    Yang, Shijie
    Li, Liang
    Wang, Shuhui
    Zhang, Weigang
    Huang, Qingming
    Tian, Qi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 3124 - 3136
  • [26] Retrieval Augmented Convolutional Encoder-decoder Networks for Video Captioning
    Chen, Jingwen
    Pan, Yingwei
    Li, Yehao
    Yao, Ting
    Chao, Hongyang
    Mei, Tao
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [27] Wafer Pattern Counting, Detection and Classification Based on Encoder-Decoder CNN Structure
    Lin, Yu
    [J]. 2022 INTERMOUNTAIN ENGINEERING, TECHNOLOGY AND COMPUTING (IETC), 2022,
  • [28] Attention-based encoder-decoder networks for workflow recognition
    Zhang, Min
    Hu, Haiyang
    Li, Zhongjin
    Chen, Jie
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (28-29) : 34973 - 34995
  • [29] Video Summarization With Attention-Based Encoder-Decoder Networks
    Ji, Zhong
    Xiong, Kailin
    Pang, Yanwei
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (06) : 1709 - 1717
  • [30] PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks
    Tai, Xue-Cheng
    Liu, Hao
    Chan, Raymond
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (01): : 540 - 594