Crowd Counting and Density Estimation by Trellis Encoder-Decoder Networks

被引:231
|
作者
Jiang, Xiaolong [1 ]
Xiao, Zehao [1 ]
Zhang, Baochang [3 ]
Zhen, Xiantong [4 ]
Cao, Xianbin [1 ,2 ]
Doermann, David [5 ]
Shao, Ling [4 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Minist Ind & Informat Technol China, Key Lab Adv Technol Near Space Informat Syst, Beijing, Peoples R China
[3] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing, Peoples R China
[4] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates
[5] SUNY Buffalo, Dept Comp Sci & Engn, Buffalo, NY USA
关键词
D O I
10.1109/CVPR.2019.00629
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Crowd counting has recently attracted increasing interest in computer vision but remains a challenging problem. In this paper, we propose a trellis encoder-decoder network (TEDnet) for crowd counting, which focuses on generating high-quality density estimation maps. The major contributions are four-fold. First, we develop a new trellis architecture that incorporates multiple decoding paths to hierarchically aggregate features at different encoding stages, which improves the representative capability of convolutional features for large variations in objects. Second, we employ dense skip connections interleaved across paths to facilitate sufficient multi-scale feature fusions, which also helps TEDnet to absorb the supervision information. Third, we propose a new combinatorial loss to enforce similarities in local coherence and spatial correlation between maps. By distributedly imposing this combinatorial loss on intermediate outputs, TEDnet can improve the back-propagation process and alleviate the gradient vanishing problem. Finally, on four widely-used benchmarks, our TEDnet achieves the best overall performance in terms of both density map quality and counting accuracy, with an improvement up to 14% in MAE metric. These results validate the effectiveness of TEDnet for crowd counting.
引用
收藏
页码:6126 / 6135
页数:10
相关论文
共 50 条
  • [31] Multimodal Encoder-Decoder Attention Networks for Visual Question Answering
    Chen, Chongqing
    Han, Dezhi
    Wang, Jun
    [J]. IEEE ACCESS, 2020, 8 : 35662 - 35671
  • [32] PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks
    Tai, Xue-Cheng
    Liu, Hao
    Chan, Raymond
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2024, 17 (01): : 540 - 594
  • [33] Temporal Deformable Convolutional Encoder-Decoder Networks for Video Captioning
    Chen, Jingwen
    Pan, Yingwei
    Li, Yehao
    Yao, Ting
    Chao, Hongyang
    Mei, Tao
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 8167 - 8174
  • [34] Attention-based encoder-decoder networks for workflow recognition
    Min Zhang
    Haiyang Hu
    Zhongjin Li
    Jie Chen
    [J]. Multimedia Tools and Applications, 2021, 80 : 34973 - 34995
  • [35] EEG Channel Interpolation Using Deep Encoder-decoder Networks
    Saba-Sadiya, Sari
    Alhanai, Tuka
    Liu, Taosheng
    Ghassemi, Mohammad M.
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2432 - 2439
  • [36] Fetal electrocardiography extraction with residual convolutional encoder-decoder networks
    Zhong, Wei
    Liao, Lijuan
    Guo, Xuemei
    Wang, Guoli
    [J]. AUSTRALASIAN PHYSICAL & ENGINEERING SCIENCES IN MEDICINE, 2019, 42 (04) : 1081 - 1089
  • [37] SpeedER: A Supervised Encoder-Decoder Driven Engine for Effective Resistance Estimation of Power Delivery Networks
    Wu, Bing-Yue
    Fang, Shao-Yun
    Chang, Hsiang-Wen
    Wei, Peter
    [J]. MLCAD '22: PROCEEDINGS OF THE 2022 ACM/IEEE 4TH WORKSHOP ON MACHINE LEARNING FOR CAD (MLCAD), 2022, : 55 - 61
  • [38] Video to Text Study using an Encoder-Decoder Networks Approach
    Ismael Orozco, Carlos
    Elena Buemi, Maria
    Jacobo Berlles, Julio
    [J]. 2018 37TH INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY (SCCC), 2018,
  • [39] Deep encoder-decoder networks for belt longitudinal tear detection
    You, Lei
    Luo, Minghua
    Zhu, Xinglin
    Zhou, Bin
    [J]. MEASUREMENT & CONTROL, 2024,
  • [40] Attention-based encoder-decoder networks for state of charge estimation of lithium-ion battery
    Wu, Lifeng
    Zhang, Yu
    [J]. ENERGY, 2023, 268