SCAF-Net: Scene Context Attention-Based Fusion Network for Vehicle Detection in Aerial Imagery

被引:5
|
作者
Wang, Minghui [1 ]
Li, Qingpeng [2 ,3 ]
Gu, Yunchao [1 ]
Fang, Leyuan [2 ,3 ]
Zhu, Xiao Xiang [4 ,5 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Virtual Real Technol & Syst, Beijing 100191, Peoples R China
[2] Hunan Univ, Sch Robot, State Lab Robot Visual Percept & Control Technol, Changsha 410082, Hunan, Peoples R China
[3] Hunan Univ, Coll Elect & Engn, Changsha 410082, Hunan, Peoples R China
[4] German Aerosp Ctr, Remote Sensing Technol Inst, D-82234 Wessling, Germany
[5] Tech Univ Munich, Signal Proc Earth Observat, D-80333 Munich, Germany
基金
中国国家自然科学基金;
关键词
Vehicle detection; Feature extraction; Task analysis; Remote sensing; Computer architecture; Training; Testing; Attention-based model; deep learning; fusion network; remote sensing; vehicle detection;
D O I
10.1109/LGRS.2021.3107281
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, deep learning methods have achieved great success for vehicle detection tasks in aerial imagery. However, most existing methods focus only on extracting latent vehicle target features, and rarely consider the scene context as vital prior knowledge. In this letter, we propose a scene context attention-based fusion network (SCAF-Net), to fuse the scene context of vehicles into an end-to-end vehicle detection network. First, we propose a novel strategy, patch cover, to keep the original target and scene context information in raw aerial images of a large scale as much as possible. Next, we use an improved YOLO-v3 network as one branch of SCAF-Net, to generate vehicle candidates on each patch. Here, a novel branch for the scene context is utilized to extract the latent scene context of vehicles on each patch without any extra annotations. Then, these two branches above are concatenated together as a fusion network, and we apply an attention-based model to further extract vehicle candidates of each local scene. Finally, all vehicle candidates of different patches, are merged by global nonmax suppress (g-NMS) to output the detection result of the whole original image. Experimental results demonstrate that our proposed method outperforms the comparison methods with both high detection accuracy and speed. Our code is released at https://github.com/minghuicode/SCAF-Net.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Attention-Based Scene Text Detection on Dual Feature Fusion
    Li, Yuze
    Silamu, Wushour
    Wang, Zhenchao
    Xu, Miaomiao
    [J]. SENSORS, 2022, 22 (23)
  • [2] Attention-Based Context Aware Network for Semantic Comprehension of Aerial Scenery
    Shi, Weipeng
    Qin, Wenhu
    Yun, Zhonghua
    Ping, Peng
    Wu, Kaiyang
    Qu, Yuke
    [J]. SENSORS, 2021, 21 (06) : 1 - 23
  • [3] Cloud Detection for Satellite Imagery Using Attention-Based U-Net Convolutional Neural Network
    Guo, Yanan
    Cao, Xiaoqun
    Liu, Bainian
    Gao, Mei
    [J]. SYMMETRY-BASEL, 2020, 12 (06):
  • [4] Attention-Based Multi-Modal Fusion Network for Semantic Scene Completion
    Li, Siqi
    Zou, Changqing
    Li, Yipeng
    Zhao, Xibin
    Gao, Yue
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11402 - 11409
  • [5] Attention-based acoustic feature fusion network for depression detection
    Xu, Xiao
    Wang, Yang
    Wei, Xinru
    Wang, Fei
    Zhang, Xizhe
    [J]. NEUROCOMPUTING, 2024, 601
  • [6] MFCD-Net: Cross Attention Based Multimodal Fusion Network for DPC Imagery Cloud Detection
    Zhang, Jingjing
    Ge, Kai
    Xun, Lina
    Sun, Xiaobing
    Xiong, Wei
    Zou, Mingmin
    Zhong, Jinqin
    Li, Teng
    [J]. REMOTE SENSING, 2022, 14 (16)
  • [7] RADC-Net: A residual attention based convolution network for aerial scene classification
    Bi, Qi
    Qin, Kun
    Zhang, Han
    Li, Zhili
    Xu, Kai
    [J]. NEUROCOMPUTING, 2020, 377 : 345 - 359
  • [8] Scene Text Detection via Deep Semantic Feature Fusion and Attention-based Refinement
    Song, Yu
    Cui, Yuanshun
    Han, Hu
    Shan, Shiguang
    Chen, Xilin
    [J]. 2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3747 - 3752
  • [9] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    [J]. Expert Systems with Applications, 2024, 235
  • [10] An Attention-Based Context Fusion Network for Spatiotemporal Prediction of Sea Surface Temperature
    Shi, Benyun
    Hao, Yingjian
    Feng, Liu
    Ge, Conghui
    Peng, Yue
    He, Hailun
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21