TimeDistributed-CNN-LSTM: A Hybrid Approach Combining CNN and LSTM to Classify Brain Tumor on 3D MRI Scans Performing Ablation Study

被引:60
|
作者
Montaha, Sidratul [1 ]
Azam, Sami [2 ]
Rafid, A. K. M. Rakibul Haque [1 ]
Hasan, Md. Zahid [1 ]
Karim, Asif [2 ]
Islam, Ashraful [3 ]
机构
[1] Daffodil Int Univ, Dept Comp Sci & Engn, Dhaka 1207, Bangladesh
[2] Charles Darwin Univ, Coll Engn IT & Environm, Casuarina, NT 0909, Australia
[3] Univ Louisiana Lafayette, Sch Comp & Informat, Lafayette, LA 70504 USA
关键词
Magnetic resonance imaging; Tumors; Three-dimensional displays; Solid modeling; Brain modeling; Cancer; Imaging; Deep learning; brain tumor classification; 3D MRI; hybrid CNN LSTM; 3D CNN; ablation study; CENTRAL-NERVOUS-SYSTEM; CLASSIFICATION;
D O I
10.1109/ACCESS.2022.3179577
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Identification of brain tumors at an early stage is crucial in cancer diagnosis, as a timely diagnosis can increase the chances of survival. Considering the challenges of tumor biopsies, three dimensional (3D) Magnetic Resonance Imaging (MRI) are extensively used in analyzing brain tumors using deep learning. In this study, three BraTS datasets are employed to classify brain tumor into two classes where each of the datasets contains four 3D MRI sequences for a single patient. This research is composed of two approaches. In the first part, we propose a hybrid model named TimeDistributed-CNN-LSTM (TD- CNN-LSTM) combining 3D Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) where each layer is wrapped with a TimeDistributed function. The objective is to consider all the four MRI sequences of each patient as a single input data because every sequence contains necessary information of tumor. Therefore, the model is developed with optimal configuration performing ablation study for layer architecture and hyper-parameters. In the second part, a 3D CNN model is trained respectively with each of the MRI sequences to compare the performance. Moreover, the datasets are preprocessed to ensure highest performance. Results demonstrate that the TD-CNN-LSTM network outperforms 3D CNN achieving the highest test accuracy of 98.90%. Later, to evaluate the performance consistency, the TD-CNN-LSTM model is evaluated with K-fold cross validation. The approach of putting together all the MRI sequences at a time with good generalization capability can be used in future medical research which can aid radiologists in tumor diagnostics effectively.
引用
收藏
页码:60039 / 60059
页数:21
相关论文
共 11 条
  • [1] A novel hybrid architecture for video frame prediction: combining convolutional LSTM and 3D CNN
    Aravinda, C. V.
    Al-Shehari, Taher
    Alsadhan, Nasser A.
    Shetty, Shashank
    Padmajadevi, G.
    Reddy, K. R. Udaya Kumar
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2025, 22 (01)
  • [2] Combining CNN and LSTM for activity of daily living recognition with a 3D matrix skeleton representation
    Ercolano, Giovanni
    Rossi, Silvia
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (02) : 175 - 185
  • [3] Combining CNN and LSTM for activity of daily living recognition with a 3D matrix skeleton representation
    Giovanni Ercolano
    Silvia Rossi
    Intelligent Service Robotics, 2021, 14 : 175 - 185
  • [4] Advancing human action recognition: A hybrid approach using attention-based LSTM and 3D CNN
    Saoudi, El Mehdi
    Jaafari, Jaafar
    Andaloussi, Said Jai
    SCIENTIFIC AFRICAN, 2023, 21
  • [5] Sea Surface Temperature Prediction Approach Based on 3D CNN and LSTM with Attention Mechanism
    Qiao, Baiyou
    Wu, Zhongqiang
    Tang, Zhong
    Wu, Gang
    2022 24TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ARITIFLCIAL INTELLIGENCE TECHNOLOGIES TOWARD CYBERSECURITY, 2022, : 342 - +
  • [6] Sea Surface Temperature Prediction Approach Based on 3D CNN and LSTM with Attention Mechanism
    Qiao, Baiyou
    Wu, Zhongqiang
    Tang, Zhong
    Wu, Gang
    2021 23RD INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT 2021): ON-LINE SECURITY IN PANDEMIC ERA, 2021, : 342 - 347
  • [7] Multimodal intelligent logistics robot combining 3D CNN, LSTM, and visual SLAM for path planning and control
    Han, Zhuqin
    FRONTIERS IN NEUROROBOTICS, 2023, 17
  • [8] Enhancing Visual Speech Recognition for Deaf Individuals: A Hybrid LSTM and CNN 3D Model for Improved Accuracy
    Shashidhar, R.
    Shashank, M. P.
    Sahana, B.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2024, 49 (09) : 11925 - 11941
  • [9] Multi-resolution 3D CNN for MRI Brain Tumor Segmentation and Survival Prediction
    Amian, Mehdi
    Soltaninejad, Mohammadreza
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2019), PT I, 2020, 11992 : 221 - 230
  • [10] Deep Multi-Scale 3D Convolutional Neural Network (CNN) for MRI Gliomas Brain Tumor Classification
    Hiba Mzoughi
    Ines Njeh
    Ali Wali
    Mohamed Ben Slima
    Ahmed BenHamida
    Chokri Mhiri
    Kharedine Ben Mahfoudhe
    Journal of Digital Imaging, 2020, 33 : 903 - 915