TAFNet: A Three-Stream Adaptive Fusion Network for RGB-T Crowd Counting

被引:24
|
作者
Tang, Haihan [1 ]
Wang, Yi [1 ]
Chau, Lap-Pui [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore, Singapore
关键词
RGB-T; crowd counting; three-stream network;
D O I
10.1109/ISCAS48785.2022.9937583
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a three-stream adaptive fusion network named TAFNet, which uses paired RGB and thermal images for crowd counting. Specifically, TAFNet is divided into one main stream and two auxiliary streams. We combine a pair of RGB and thermal images to constitute the input of main stream. Two auxiliary streams respectively exploit RGB image and thermal image to extract modality-specific features. Besides, we propose an Information Improvement Module (IIM) to fuse the modality-specific features into the main stream adaptively. Experiment results on RGBT-CC dataset show that our method achieves more than 20% improvement on mean average error and root mean squared error compared with state-of-the-art method. The source code will be publicly available at https://github.com/TANGHAIHAN/TAFNet.
引用
收藏
页码:3299 / 3303
页数:5
相关论文
共 50 条
  • [1] Light-sensitive and adaptive fusion network for RGB-T crowd counting
    Huang, Liangjun
    Kang, Wencan
    Chen, Guangkai
    Zhang, Qing
    Zhang, Jianwei
    [J]. VISUAL COMPUTER, 2024, 40 (10): : 7279 - 7292
  • [2] Spatial exchanging fusion network for RGB-T crowd counting
    Rao, Chaoqun
    Wan, Lin
    [J]. NEUROCOMPUTING, 2024, 609
  • [3] CONDITIONAL RGB-T FUSION FOR EFFECTIVE CROWD COUNTING
    Pahwa, Esha
    Kapadia, Sanjeet
    Luthra, Achleshwar
    Sheeranali, Shreyas
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 376 - 380
  • [4] CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting
    Cai, Jialu
    Wang, Qing
    Jiang, Shengqin
    [J]. BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 427 - 436
  • [5] Daacfnet: Discriminative Activation and Adjacent Context Fusion Network for Rgb-T Crowd Counting
    Xie, Zhengxuan
    Shao, Feng
    Mu, Baoyang
    Chen, Hangwei
    [J]. SSRN, 2024,
  • [6] DEFNet: Dual-Branch Enhanced Feature Fusion Network for RGB-T Crowd Counting
    Zhou, Wujie
    Pan, Yi
    Lei, Jingsheng
    Ye, Lv
    Yu, Lu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24540 - 24549
  • [7] CAGNet: Coordinated attention guidance network for RGB-T crowd counting
    Yang, Xun
    Zhou, Wujie
    Yan, Weiqing
    Qian, Xiaohong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [8] BGDFNet: Bidirectional Gated and Dynamic Fusion Network for RGB-T Crowd Counting in Smart City System
    Xie, Zhengxuan
    Shao, Feng
    Mu, Baoyang
    Chen, Hangwei
    Jiang, Qiuping
    Lu, Chenyang
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [9] A three-stream fusion and self-differential attention network for multi-modal crowd counting
    Tang, Haihan
    Wang, Yi
    Lin, Zhiping
    Chau, Lap-Pui
    Zhuang, Huiping
    [J]. PATTERN RECOGNITION LETTERS, 2024, 183 : 35 - 41
  • [10] A unified RGB-T crowd counting learning framework
    Gu, Siqi
    Lian, Zhichao
    [J]. IMAGE AND VISION COMPUTING, 2023, 131