Deformable Dilated Faster R-CNN for Universal Lesion Detection in CT Images

被引:4
|
作者
Hellmann, Fabio [1 ]
Ren, Zhao [2 ]
Andre, Elisabeth [1 ]
Schuller, Bjoern W. [2 ,3 ]
机构
[1] Univ Augsberg, Chair Human Ctr AI, D-86159 Augsburg, Germany
[2] Univ Augsburg, Chair Embedded Intelligence Hlth Care & Wellbeing, D-86159 Augsburg, Germany
[3] Imperial Coll London, GLAM Grp Language Audio & Mus, London SW7 2AZ, England
关键词
D O I
10.1109/EMBC46164.2021.9631021
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Cancer is a major public health issue and takes the second-highest toll of deaths caused by non-communicable diseases worldwide. Automatically detecting lesions at an early stage is essential to increase the chance of a cure. This study proposes a novel dilated Faster R-CNN with modulated deformable convolution and modulated deformable positive-sensitive region of interest pooling to detect lesions in computer tomography images. A pre-trained VGG-16 is transferred as the backbone of Faster R-CNN, followed by a region proposal network and a region of interest pooling layer to achieve lesion detection. The modulated deformable convolutional layers are employed to learn deformable convolutional filters, while the modulated deformable positive-sensitive region of interest pooling provides an enhanced feature extraction on the feature maps. Moreover, dilated convolutions are combined with the modulated deformable convolutions to fine-tune the VGG-16 model with multi-scale receptive fields. In the experiments evaluated on the DeepLesion dataset, the modulated deformable positive-sensitive region of interest pooling model achieves the highest sensitivity score of 58.8% on average with dilation of [4, 4, 4] and outperforms state-of-the-art models in the range of [2, 8] average false positives per image. This research demonstrates the suitability of dilation modifications and the possibility of enhancing the performance using a modulated deformable positive-sensitive region of interest pooling layer for universal lesion detectors.
引用
收藏
页码:2896 / 2902
页数:7
相关论文
共 50 条
  • [1] Detection and classification of COVID-19 by using faster R-CNN and mask R-CNN on CT images
    M. Emin Sahin
    Hasan Ulutas
    Esra Yuce
    Mustafa Fatih Erkoc
    [J]. Neural Computing and Applications, 2023, 35 : 13597 - 13611
  • [2] Detection and classification of COVID-19 by using faster R-CNN and mask R-CNN on CT images
    Sahin, M. Emin
    Ulutas, Hasan
    Yuce, Esra
    Erkoc, Mustafa Fatih
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (18): : 13597 - 13611
  • [3] Lung lesions detection from CT images based on the modified Faster R-CNN
    Xu, Linlin
    Mao, Xuemin
    Sun, Minmin
    Liu, Wentao
    Wang, Yifan
    Tang, Yuyang
    [J]. PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON COMPUTER, INFORMATION AND TELECOMMUNICATION SYSTEMS (CITS), 2020, : 127 - 131
  • [4] Faster R-CNN for Detection of Carotid Plaque on Ultrasound Images
    An, Xiangjing
    Ye, Guoliang
    Zhou, Xiaoan
    Jiao, Zhibin
    Ding, Shangwei
    Xie, Yanhua
    [J]. 2019 COMPUTING, COMMUNICATIONS AND IOT APPLICATIONS (COMCOMAP), 2019, : 64 - 69
  • [5] A Lightweight Faster R-CNN for Ship Detection in SAR Images
    Li, Yiding
    Zhang, Shunsheng
    Wang, Wen-Qin
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [6] GFD Faster R-CNN: Gabor Fractal DenseNet Faster R-CNN for Automatic Detection of Esophageal Abnormalities in Endoscopic Images
    Ghatwary, Noha
    Zolgharni, Massoud
    Ye, Xujiong
    [J]. MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2019), 2019, 11861 : 89 - 97
  • [7] Driver action recognition using deformable and dilated faster R-CNN with optimized region proposals
    Lu, Mingqi
    Hu, Yaocong
    Lu, Xiaobo
    [J]. APPLIED INTELLIGENCE, 2020, 50 (04) : 1100 - 1111
  • [8] Driver action recognition using deformable and dilated faster R-CNN with optimized region proposals
    Mingqi Lu
    Yaocong Hu
    Xiaobo Lu
    [J]. Applied Intelligence, 2020, 50 : 1100 - 1111
  • [9] Face Detection with the Faster R-CNN
    Jiang, Huaizu
    Learned-Miller, Erik
    [J]. 2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, : 650 - 657
  • [10] IMPROVING FASTER R-CNN WITH DILATED CONVOLUTIONS AND BILINEAR INTERPOLATION FOR TABLE DETECTION
    Kazdar, Takwa
    Mseddi, Wided Souidene
    Jmal, Marwa
    Attia, Rabah
    [J]. MODELLING AND SIMULATION 2021: 35TH ANNUAL EUROPEAN SIMULATION AND MODELLING CONFERENCE 2021 (ESM 2021), 2021, : 179 - 183