Spatio-temporal context based recurrent visual attention model for lymph node detection

被引:2
|
作者
Peng, Haixin [1 ]
Peng, Yinjun [1 ,2 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Shandong Prov Key Lab Wisdom Min Informat Technol, Qingdao, Peoples R China
基金
中国国家自然科学基金;
关键词
biomedical image classification; false-positive reduction; mixture density networks; recurrent visual attention; CONVOLUTIONAL NEURAL-NETWORKS; AUTOMATIC DETECTION; SEGMENTATION; CNN;
D O I
10.1002/ima.22430
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
False-positive reduction is one of the most crucial components in an automated lymph nodes (LNs) detection task in volumetric computed tomography (CT) scans, which is a highly sought goal for cancer diagnosis and early treatment. In this article, treating the three-dimensional (3D) LN detection task as object detection on sequence problem, we propose a novel spatio-temporal context-based recurrent visual attention model (STRAM) for the LNs false positive reduction. We firstly extract the deep spatial features maps for two-dimensional LN patches from pre-trained Inception-V3 model. A new Gaussian kernel-based spatial attention method is then presented to extract the most discriminating spatial features for the corresponding center slices. Additionally, to combine the temporal information between 3D CT slices, we devise a novel "Siamese" mixture density networks which can learn to adaptively focus on the most relevant parts of the CT slices. Considering the lesion areas always locate around the centroid of the 3D CT scans, a hard constraint is imposed on the predicted attention locations with batch normalization technique and the Siamese architecture. The proposed model is a fully differentiable unit that can be optimized end-to-end by using stochastic gradient descent. The effectiveness of our method is verified on LN dataset: 388 mediastinal LNs labeled by radiologists in 90 patient CT scans, and 595 abdominal LNs in 86 patient CT scans. Our method demonstrates sensitivities of about 87%/82% at 3 FP/vol. and 93%/89% at 6 FP/vol. for mediastinum and abdomen, respectively, which compares favorably to previous methods.
引用
收藏
页码:1220 / 1242
页数:23
相关论文
共 50 条
  • [1] A spatio-temporal model of the selective human visual attention
    Le Meur, O
    Thoreau, D
    Le Callet, P
    Barba, D
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3201 - 3204
  • [2] Anomaly detection based on spatio-temporal sparse representation and visual attention analysis
    Wang, Chen
    Yao, Hongxun
    Sun, Xiaoshuai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (05) : 6263 - 6279
  • [3] Anomaly detection based on spatio-temporal sparse representation and visual attention analysis
    Chen Wang
    Hongxun Yao
    Xiaoshuai Sun
    Multimedia Tools and Applications, 2017, 76 : 6263 - 6279
  • [4] LANDSLIDE CHANGE DETECTION BASED ON SPATIO-TEMPORAL CONTEXT
    Huang Qingqing
    Meng Yu
    Chen Jingbo
    Yue Anzhi
    Lin Lei
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1095 - 1098
  • [5] UAV Visual Object Tracking Based on Spatio-Temporal Context
    He, Yongxiang
    Chao, Chuang
    Zhang, Zhao
    Guo, Hongwu
    Ma, Jianjun
    DRONES, 2024, 8 (12)
  • [6] Robust Online Learned Spatio-Temporal Context Model for Visual Tracking
    Wen, Longyin
    Cai, Zhaowei
    Lei, Zhen
    Yi, Dong
    Li, Stan Z.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (02) : 785 - 796
  • [7] Spatio-temporal Outlier Detection Based on Context: A Summary of Results
    Wang, Zhanquan
    Duan, Chao
    Chen, Defeng
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 428 - 436
  • [8] SASRM: A Semantic and Attention Spatio-temporal Recurrent Model for Next Location Prediction
    Zhang, Xu
    Li, Boming
    Song, Chao
    Huang, Zhengwen
    Li, Yan
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [9] Novel Soft Sensor Model based on Spatio-Temporal Attention
    Hu, Xuan
    Geng, Zhiqiang
    Han, Yongming
    Huang, Wei
    Chen, Kai
    Xie, Feng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [10] The mechanism of visual attention is the spatio-temporal salience map
    Sperling, G
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1996, 31 (3-4) : 3366 - 3366