Occluded prohibited object detection in X-ray images with global Context-aware Multi-Scale feature Aggregation

被引:13
|
作者
Ma, Chunjie [1 ,2 ]
Zhuo, Li [1 ,2 ]
Li, Jiafeng [1 ,2 ]
Zhang, Yutong [1 ,2 ]
Zhang, Jing [1 ,2 ]
机构
[1] Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Fac Informat, Beijing 100124, Peoples R China
基金
北京市自然科学基金;
关键词
X-ray Image; Occluded Prohibited Object Detection; Gabor Convolution; Global Context Feature Extraction; Dual Scale Feature Aggregation;
D O I
10.1016/j.neucom.2022.11.034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Prohibited Object Detection (POD) in X-ray images plays an important role in protecting public safety. Automatic and accurate POD is required to relieve the working pressure of security inspectors. However, the existing methods cannot obtain a satisfactory detection accuracy, and especially, the prob-lem of object occlusion also has not been solved well. Therefore, in this paper, according to the specific characteristics of X-ray images as well as low-level and high-level features of Convolutional Neural Network (CNN), different feature enhancement strategies have been elaborately designed for occluded POD. First, a learnable Gabor convolutional layer is designed and embedded into the low layer of the net-work to enhance the network's capability to capture the edge and contour information of object. A Spatial Attention (SA) mechanism is then designed to weight the output features of the Gabor convolutional layer to enhance the spatial structure information of object and suppress the background noises simul-taneously. For the high-level features, Global Context Feature Extraction (GCFE) module is proposed to extract multi-scale global contextual information of object. And, a Dual Scale Feature Aggregation (DSFA) module is proposed to fuse these global features with those of another layer. To verify the effec-tiveness of the proposed modules, they are embedded into typical one-stage and two-stage object detec-tion frameworks, i.e., Faster R-CNN and YOLO v5L, obtaining POD-F and POD-Y methods, respectively. The proposed methods have been extensively evaluated on three publicly available benchmark datasets, namely SIXray, OPIXray and WIXray. The experimental results show that, compared with existing meth-ods, the proposed POD-Y method can achieve a state-of-the-art detection accuracy. And POD-F can also achieve a competitive detection performance among the two-stage detection methods.1 (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] Feature Enhancement for Multi-scale Object Detection
    Zheng, Huicheng
    Chen, Jiajie
    Chen, Lvran
    Li, Ye
    Yan, Zhiwei
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (02) : 1907 - 1919
  • [32] Feature Enhancement for Multi-scale Object Detection
    Huicheng Zheng
    Jiajie Chen
    Lvran Chen
    Ye Li
    Zhiwei Yan
    [J]. Neural Processing Letters, 2020, 51 : 1907 - 1919
  • [33] LGCNet: A local-to-global context-aware feature augmentation network for salient object detection
    Ji, Yuzhu
    Zhang, Haijun
    Gao, Feng
    Sun, Haofei
    Wei, Haokun
    Wang, Nan
    Yang, Biao
    [J]. INFORMATION SCIENCES, 2022, 584 : 399 - 416
  • [34] Fine-YOLO: A Simplified X-ray Prohibited Object Detection Network Based on Feature Aggregation and Normalized Wasserstein Distance
    Zhou, Yu-Tong
    Cao, Kai-Yang
    Li, De
    Piao, Jin-Chun
    [J]. SENSORS, 2024, 24 (11)
  • [35] MSCAF-Net: A General Framework for Camouflaged Object Detection via Learning Multi-Scale Context-Aware Features
    Liu, Yu
    Li, Haihang
    Cheng, Juan
    Chen, Xun
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (09) : 4934 - 4947
  • [36] COVID Detection From Chest X-Ray Images Using Multi-Scale Attention
    Dhere, Abhinav
    Sivaswamy, Jayanthi
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (04) : 1496 - 1505
  • [37] Multi-scale object detection in UAV images based on adaptive feature fusion
    Tan, Siqi
    Duan, Zhijian
    Pu, Longzhong
    [J]. PLOS ONE, 2024, 19 (03):
  • [38] CTA-FPN: Channel-Target Attention Feature Pyramid Network for Prohibited Object Detection in X-ray Images
    Zhang, Yi
    Zhuo, Li
    Ma, Chunjie
    Zhang, Yutong
    Li, Jiafeng
    [J]. SENSING AND IMAGING, 2023, 24 (01):
  • [39] CTA-FPN: Channel-Target Attention Feature Pyramid Network for Prohibited Object Detection in X-ray Images
    Yi Zhang
    Li Zhuo
    Chunjie Ma
    Yutong Zhang
    Jiafeng Li
    [J]. Sensing and Imaging, 24
  • [40] Global and Local Multi-scale Feature Fusion for Object Detection and Semantic Segmentation
    Lim, Young-Chul
    Kang, Minsung
    [J]. 2019 30TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV19), 2019, : 2557 - 2562