Light-sensitive and adaptive fusion network for RGB-T crowd counting

被引:1
|
作者
Huang, Liangjun [1 ]
Kang, Wencan [1 ]
Chen, Guangkai [1 ]
Zhang, Qing [1 ]
Zhang, Jianwei [2 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
[2] Univ Hamburg, Dept Informat, D-20354 Hamburg, Germany
来源
VISUAL COMPUTER | 2024年 / 40卷 / 10期
基金
上海市自然科学基金;
关键词
RGB-T image; Crowd counting; Light-sensitive; Cross-modal fusion; PEOPLE; IMAGE;
D O I
10.1007/s00371-024-03388-1
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Mainstream RGB-T crowd counting methods use cross-modal complementary information to improve the counting accuracy. However, most of them neglect the effect of lighting variation on cross-modal data fusion. In this paper, we propose a Light-sensitive and Adaptive Fusion Network (LAFNet) for RGB-T crowd counting. Specifically, we present a Modality-specific Feature Extraction Module (MFEM) that fuses the lighting information, and a Light-sensitive and Adaptive Fusion Module (LAFM) that adjusts the fusion strategies of different modalities according to the lighting conditions of the input crowd images. Moreover, we propose an Improved Multi-scale Extraction Module (IMEM) to extract and fuse multi-modal at different scales. We evaluate our method on the RGBT-CC dataset and the experiment results show the validity of the model and its effectiveness in various scenarios.
引用
收藏
页码:7279 / 7292
页数:14
相关论文
共 50 条
  • [1] Spatial exchanging fusion network for RGB-T crowd counting
    Rao, Chaoqun
    Wan, Lin
    [J]. NEUROCOMPUTING, 2024, 609
  • [2] TAFNet: A Three-Stream Adaptive Fusion Network for RGB-T Crowd Counting
    Tang, Haihan
    Wang, Yi
    Chau, Lap-Pui
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3299 - 3303
  • [3] CONDITIONAL RGB-T FUSION FOR EFFECTIVE CROWD COUNTING
    Pahwa, Esha
    Kapadia, Sanjeet
    Luthra, Achleshwar
    Sheeranali, Shreyas
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 376 - 380
  • [4] CrowdFusion: Refined Cross-Modal Fusion Network for RGB-T Crowd Counting
    Cai, Jialu
    Wang, Qing
    Jiang, Shengqin
    [J]. BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 427 - 436
  • [5] Daacfnet: Discriminative Activation and Adjacent Context Fusion Network for Rgb-T Crowd Counting
    Xie, Zhengxuan
    Shao, Feng
    Mu, Baoyang
    Chen, Hangwei
    [J]. SSRN, 2024,
  • [6] DEFNet: Dual-Branch Enhanced Feature Fusion Network for RGB-T Crowd Counting
    Zhou, Wujie
    Pan, Yi
    Lei, Jingsheng
    Ye, Lv
    Yu, Lu
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 24540 - 24549
  • [7] CAGNet: Coordinated attention guidance network for RGB-T crowd counting
    Yang, Xun
    Zhou, Wujie
    Yan, Weiqing
    Qian, Xiaohong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [8] BGDFNet: Bidirectional Gated and Dynamic Fusion Network for RGB-T Crowd Counting in Smart City System
    Xie, Zhengxuan
    Shao, Feng
    Mu, Baoyang
    Chen, Hangwei
    Jiang, Qiuping
    Lu, Chenyang
    Ho, Yo-Sung
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73
  • [9] A unified RGB-T crowd counting learning framework
    Gu, Siqi
    Lian, Zhichao
    [J]. IMAGE AND VISION COMPUTING, 2023, 131
  • [10] CGINet: Cross-modality grade interaction network for RGB-T crowd counting
    Pan, Yi
    Zhou, Wujie
    Qian, Xiaohong
    Mao, Shanshan
    Yang, Rongwang
    Yu, Lu
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126