Spatio-temporal context based recurrent visual attention model for lymph node detection

被引:2
|
作者
Peng, Haixin [1 ]
Peng, Yinjun [1 ,2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Shandong Prov Key Lab Wisdom Min Informat Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
biomedical image classification; false-positive reduction; mixture density networks; recurrent visual attention; CONVOLUTIONAL NEURAL-NETWORKS; AUTOMATIC DETECTION; SEGMENTATION; CNN;
D O I
10.1002/ima.22430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
False-positive reduction is one of the most crucial components in an automated lymph nodes (LNs) detection task in volumetric computed tomography (CT) scans, which is a highly sought goal for cancer diagnosis and early treatment. In this article, treating the three-dimensional (3D) LN detection task as object detection on sequence problem, we propose a novel spatio-temporal context-based recurrent visual attention model (STRAM) for the LNs false positive reduction. We firstly extract the deep spatial features maps for two-dimensional LN patches from pre-trained Inception-V3 model. A new Gaussian kernel-based spatial attention method is then presented to extract the most discriminating spatial features for the corresponding center slices. Additionally, to combine the temporal information between 3D CT slices, we devise a novel "Siamese" mixture density networks which can learn to adaptively focus on the most relevant parts of the CT slices. Considering the lesion areas always locate around the centroid of the 3D CT scans, a hard constraint is imposed on the predicted attention locations with batch normalization technique and the Siamese architecture. The proposed model is a fully differentiable unit that can be optimized end-to-end by using stochastic gradient descent. The effectiveness of our method is verified on LN dataset: 388 mediastinal LNs labeled by radiologists in 90 patient CT scans, and 595 abdominal LNs in 86 patient CT scans. Our method demonstrates sensitivities of about 87%/82% at 3 FP/vol. and 93%/89% at 6 FP/vol. for mediastinum and abdomen, respectively, which compares favorably to previous methods.
引用
收藏
页码:1220 / 1242
页数:23
相关论文
共 50 条
  • [41] Multi-scale spatio-temporal context visual tracking algorithm based on target model adaptive update
    Chen, Faling
    Ding, Qinghai
    Luo, Haibo
    Hui, Bin
    Chang, Zheng
    Liu, Yunpeng
    AOPC 2020: OPTICAL SENSING AND IMAGING TECHNOLOGY, 2020, 11567
  • [42] Human Action Recognition Algorithm Based on Spatio-Temporal Interactive Attention Model
    Pan Na
    Jiang Min
    Kong Jun
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (18)
  • [43] Spatio-Temporal Video Denoising Based on Attention Mechanism
    Ji, Kai
    Lei, Weimin
    Zhang, Wei
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (06)
  • [44] Spatio-temporal analysis of feature-based attention
    Schoenfeld, M. A.
    Hopf, J.-M.
    Martinez, A.
    Mai, H. M.
    Sattler, C.
    Gasde, A.
    Heinze, H.-J.
    Hillyard, S. A.
    CEREBRAL CORTEX, 2007, 17 (10) : 2468 - 2477
  • [45] Spatio-Temporal Attention Model for Foreground Detection in Cross-Scene Surveillance Videos
    Liang, Dong
    Pan, Jiaxing
    Sun, Han
    Zhou, Huiyu
    SENSORS, 2019, 19 (23)
  • [46] Parallel implementation of a spatio-temporal visual saliency model
    Rahman, A.
    Houzet, D.
    Pellerin, D.
    Marat, S.
    Guyader, N.
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2011, 6 (01) : 3 - 14
  • [47] Attention Guided Spatio-Temporal Artifacts Extraction for Deepfake Detection
    Wang, Zhibing
    Li, Xin
    Ni, Rongrong
    Zhao, Yao
    PATTERN RECOGNITION AND COMPUTER VISION, PT IV, 2021, 13022 : 374 - 386
  • [48] Parallel implementation of a spatio-temporal visual saliency model
    A. Rahman
    D. Houzet
    D. Pellerin
    S. Marat
    N. Guyader
    Journal of Real-Time Image Processing, 2011, 6 : 3 - 14
  • [49] Spatio-temporal Quality Pooling Adaptive to Distortion Distribution and Visual Attention
    Li, Yichen
    Guo, Xiaoqiang
    Wang, Haiying
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [50] Action Recognition With Spatio-Temporal Visual Attention on Skeleton Image Sequences
    Yang, Zhengyuan
    Li, Yuncheng
    Yang, Jianchao
    Luo, Jiebo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (08) : 2405 - 2415