Attention-Based Multimodal Image Feature Fusion Module for Transmission Line Detection

被引:27
|
作者
Choi, Hyeyeon [1 ]
Yun, Jong Pil [2 ,3 ]
Kim, Bum Jun [1 ]
Jang, Hyeonah [1 ]
Kim, Sang Woo [1 ,4 ]
机构
[1] Pohang Univ Sci & Technol, Dept Elect Engn, Pohang 37673, South Korea
[2] Korea Inst Ind Technol KITECH, Cheonan 31056, South Korea
[3] Univ Sci & Technol, KITECH Sch, Daejeon 34113, South Korea
[4] Yonsei Univ, Inst Convergence Res & Edu Adv Technol, Seoul 03021, South Korea
关键词
Channelwise attention (CA); deep learning; feature fusion module (FFM); multibranch; transmission line detection (TLD); POWER; NETWORK; CLASSIFICATION; INSPECTION; NEST;
D O I
10.1109/TII.2022.3147833
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transmission line (TL) inspection is important for ensuring a stable supply of electricity to rural areas. Currently, there are several TL detection approaches based on computer vision; however, they have limitations owing to background clutter in visible light images. This article presents a novel multimodal image feature fusion module that utilizes both visible light and infrared images to enhance the TL-detection performance. The proposed module consists of a multibranch feature extraction (MFE) block followed by a channelwise attention (CA) block. The first block extracts the representative features of each modal input using multiple branches. The outputs of the MFE block are jointly aggregated into an attention vector in the CA block. Finally, the attention vector recalibrates each input feature of the proposed module. To reduce the number of additional parameters due to the insertion of the module, we introduced a channel-shrink factor in the MFE block and utilized a 1 x 1 convolution in the CA block. Comparison experiments with various augmented conditions of day, night, fog, and snow were conducted on a real-world dataset, which we constructed by visible light and infrared images. The results showed that the proposed module outperformed not only the case of single modal input but also the state-of-the-art fusion methods, regardless of the baseline networks. Additionally, the proposed module showed effectiveness in terms of capacity when the baseline network has a large number of weight parameters.
引用
收藏
页码:7686 / 7695
页数:10
相关论文
共 50 条
  • [1] Infrared and Visible Image Fusion via Attention-Based Adaptive Feature Fusion
    Wang, Lei
    Hu, Ziming
    Kong, Quan
    Qi, Qian
    Liao, Qing
    [J]. ENTROPY, 2023, 25 (03)
  • [2] Attention-Based Scene Text Detection on Dual Feature Fusion
    Li, Yuze
    Silamu, Wushour
    Wang, Zhenchao
    Xu, Miaomiao
    [J]. SENSORS, 2022, 22 (23)
  • [3] Attention-based multimodal image matching
    Moreshet, Aviad
    Keller, Yosi
    [J]. COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 241
  • [4] Attention-based acoustic feature fusion network for depression detection
    Xu, Xiao
    Wang, Yang
    Wei, Xinru
    Wang, Fei
    Zhang, Xizhe
    [J]. NEUROCOMPUTING, 2024, 601
  • [5] Semantic attention-based heterogeneous feature aggregation network for image fusion
    Ruan, Zhiqiang
    Wan, Jie
    Xiao, Guobao
    Tang, Zhimin
    Ma, Jiayi
    [J]. PATTERN RECOGNITION, 2024, 155
  • [6] Attention-Based Multimodal Fusion for Video Description
    Hori, Chiori
    Hori, Takaaki
    Lee, Teng-Yok
    Zhang, Ziming
    Harsham, Bret
    Hershey, John R.
    Marks, Tim K.
    Sumi, Kazuhiko
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4203 - 4212
  • [7] Attention-Based Multiscale Feature Fusion for Efficient Surface Defect Detection
    Zhao, Yuhao
    Liu, Qing
    Su, Hu
    Zhang, Jiabin
    Ma, Hongxuan
    Zou, Wei
    Liu, Song
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 10
  • [8] A feature fusion module based on complementary attention for medical image segmentation
    Yang, Mingyue
    Dong, Xiaoxuan
    Zhang, Wang
    Xie, Peng
    Li, Chuan
    Chen, Shanxiong
    [J]. DISPLAYS, 2024, 84
  • [9] Detection of Atrial Fibrillation based on Feature Fusion using Attention-based BiLSTM
    Xie, Weifang
    Chen, Cang
    Zhao, Ruijie
    Lu, Yu
    [J]. 2023 45TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY, EMBC, 2023,
  • [10] Hierarchical attention-based multimodal fusion for video captioning
    Wu, Chunlei
    Wei, Yiwei
    Chu, Xiaoliang
    Weichen, Sun
    Su, Fei
    Wang, Leiquan
    [J]. NEUROCOMPUTING, 2018, 315 : 362 - 370