SliceNets - A Scalable Approach for Object Detection in 3D CT Scans

被引:7
|
作者
Yang, Anqi [1 ]
Pan, Feng [2 ]
Saragadam, Vishwanath [1 ]
Duy Dao [2 ]
Hui, Zhuo [1 ]
Chang, Jen-Hao Rick [1 ]
Sankaranarayanan, Aswin C. [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[2] IDSS Corp, Boxboro, MA USA
关键词
D O I
10.1109/WACV48630.2021.00038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
One of the most promising approaches for automated detection of guns and other prohibited items in aviation baggage screening is the use of 3D computed tomography (CT) scans. However, automated detection, especially with deep neural networks, faces two key challenges: the high dimensionality of individual 3D scans, and the lack of labelled training data. We address these challenges using a novel image-based detection and segmentation technique that we call the slice-and fuse framework. Our approach relies on slicing the input 3D volumes, generating 2D predictions on each slice using 2D Convolutional Neural Networks (CNNs), and fusing them to obtain a 3D prediction. We develop two distinct detectors based on this slice-and-fuse strategy: the Retinal-SliceNet that uses a unified, single network with end-to-end training, and the U-SliceNet that uses a two-stage paradigm, first generating proposals using a voxel labeling network and, subsequently, refining the proposals by a 3D classification network. The networks are trained using a data augmentation approach that creates a very large training dataset by inserting weapons into 3D CT scans of threat free bags. We demonstrate that the two SliceNets outperform state-of-the-art methods on a large-scale 3D baggage CT dataset for baggage classification, 3D object detection, and 3D semantic segmentation.
引用
收藏
页码:335 / 344
页数:10
相关论文
共 50 条
  • [1] An automatic approach for 3D registration of CT scans
    Hu, Yang
    Saber, Eli
    Dianat, Sohail
    Vantaram, Sreenath Rao
    Abhyankar, Vishwas
    IMAGE PROCESSING: ALGORITHMS AND SYSTEMS X AND PARALLEL PROCESSING FOR IMAGING APPLICATIONS II, 2012, 8295
  • [2] Discriminatively Trained Templates for 3D Object Detection: A Real Time Scalable Approach
    Rios-Cabrera, Reyes
    Tuytelaars, Tinne
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, : 2048 - 2055
  • [3] Pulmonary Nodules 3D Detection on Serial CT Scans
    Wu Suiyuan
    Wang Junfeng
    2012 THIRD GLOBAL CONGRESS ON INTELLIGENT SYSTEMS (GCIS 2012), 2012, : 257 - 260
  • [4] A Heterogeneous Approach for 3D Object Detection
    Lü Z.
    Yao Z.
    Jia Y.
    Bao Y.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2021, 58 (12): : 2748 - 2759
  • [5] A Correlated Parts Model for Object Detection in Large 3D Scans
    Sunkel, M.
    Jansen, S.
    Wand, M.
    Seidel, H. -P.
    COMPUTER GRAPHICS FORUM, 2013, 32 (02) : 205 - 214
  • [6] PVTransformer: Point-to-Voxel Transformer for Scalable 3D Object Detection
    Leng, Zhaoqi
    Sun, Pei
    He, Tong
    Anguelov, Dragomir
    Tan, Mingxing
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 4238 - 4244
  • [7] VINet: Lightweight, scalable, and heterogeneous cooperative perception for 3D object detection
    Bai, Zhengwei
    Wu, Guoyuan
    Barth, Matthew J.
    Liu, Yongkang
    Sisbot, Emrah Akin
    Oguchi, Kentaro
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2023, 204
  • [8] 3D PATHOLOGICAL SIGNS DETECTION AND SCORING ON CPA CT LUNG SCANS
    Nunes, Afonso
    Desai, Sujal R.
    Semple, Thomas
    Shah, Anand
    Angelini, Elsa D.
    2021 IEEE 18TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI), 2021, : 82 - 85
  • [9] Efficient Region of Interest Detection for Liver Segmentation using 3D CT Scans
    Hiraman, Anura
    Viriri, Serestina
    Gwetu, Mandlenkosi
    2019 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2019,
  • [10] FusionNet: Incorporating Shape and Texture for Abnormality Detection in 3D Abdominal CT Scans
    Liu, Fengze
    Zhou, Yuyin
    Fishman, Elliot
    Yuille, Alan
    MACHINE LEARNING IN MEDICAL IMAGING (MLMI 2019), 2019, 11861 : 221 - 229