Real-time pneumonia prediction using pipelined spark and high-performance computing

被引:0
|
作者
Ravikumar A. [1 ]
Sriraman H. [1 ]
机构
[1] School of Computer Science and Engineering, Vellore Institute of Technology, Tamil Nadu, Chennai
关键词
Convolutional neural network; Data parallel model; Distributed deep learning; High performance computing; Parameter server; Pneumonia; Prediction model; Spark;
D O I
10.7717/PEERJ-CS.1258
中图分类号
学科分类号
摘要
Background: Pneumonia is a respiratory disease caused by bacteria; it affects many people, particularly in impoverished countries where pollution, unclean living standards, overpopulation, and insufficient medical infrastructures are prevalent. To guarantee curative therapy and boost survival chances, it is vital to detect pneumonia soon enough. Imaging using chest X-rays is the most common way of detecting pneumonia. However, analyzing chest X-rays is a complex process vulnerable to subjective variation. Moreover, the data available is growing exponentially, and it will take hours and days to train the model to predict pneumonia. Timely prediction is significant to guarantee a better cure and treatment. Existing work provided by different authors needs more precision, and the computation time for predicting pneumonia is also much longer. Therefore, there is a requirement for early forecasting. Using X-ray picture samples, the system must have a continuous and unsupervised learning system for early diagnosis. Methods: In this article, the training time of the model is accelerated using the distributed data-parallel approach and the computational power of high-performance computing devices. This research aims to diagnose pneumonia using X-ray pictures with more precision, greater speed, and fewer processing resources. Distributed deep learning techniques are gaining popularity owing to the rising need for computational resources for deep learning models with several parameters. In contrast to conventional training methods, data-parallel training enables several compute nodes to train massive deep-learning models to improve training efficiency concurrently. Deploying the model in Spark solves the scalability and acceleration. Spark's distributed processing capability reads data from multiple nodes, and the results demonstrate that training time can be drastically reduced by utilizing these techniques, which is a significant necessity when dealing with large datasets. Results: The proposed model makes the prediction 1.5 times faster than the traditional CNN model used for pneumonia prediction. The model also achieved an accuracy of 98.72%. The speed-up varying from 1.2 to 1.5 was obtained in the synchronous and asynchronous parallel model. The speed-up is reduced in the parallel asynchronous model due to the presence of straggler nodes. © 2023.
引用
收藏
页码:1 / 23
页数:22
相关论文
共 50 条
  • [21] A comparative analysis of resource allocation schemes for real-time services in high-performance computing systems
    Qureshi, Muhammad Shuaib
    Qureshi, Muhammad Bilal
    Fayaz, Muhammad
    Mashwani, Wali Khan
    Belhaouari, Samir Brahim
    Hassan, Saima
    Shah, Asadullah
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2020, 16 (08)
  • [22] HRHS: A High-Performance Real-Time Hardware Scheduler
    Derafshi, Danesh
    Norollah, Amin
    Khosroanjam, Mohsen
    Beitollahi, Hakem
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (04) : 897 - 908
  • [23] A NEW SERIES OF HIGH-PERFORMANCE REAL-TIME COMPUTERS
    ALLAN, ME
    SCHOENDORF, N
    CHATTERTON, CB
    CROSS, DM
    HEWLETT-PACKARD JOURNAL, 1984, 35 (02): : 3 - 6
  • [24] A SCHEME FOR HIGH-PERFORMANCE REAL-TIME BER MEASUREMENT
    SCHOLZ, JB
    COOK, SC
    GILES, TC
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1992, 40 (10) : 1574 - 1576
  • [25] A High-Performance Index for Real-Time Matrix Retrieval
    Wen, Zeyi
    Liang, Mingyu
    He, Bingsheng
    Xia, Zexin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (07) : 3044 - 3056
  • [26] A high-performance processor for embedded real-time control
    Cumplido, R
    Jones, S
    Goodall, RM
    Bateman, S
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (03) : 485 - 492
  • [27] High-Performance Siamese Network for Real-Time Tracking
    Du, Guocai
    Zhou, Peiyong
    Abudurexiti, Ruxianguli
    Mahpirat
    Aysa, Alimjan
    Ubul, Kurban
    SENSORS, 2022, 22 (22)
  • [28] Energy Efficient Real-Time Tasks Scheduling on High-Performance Edge-Computing Systems Using Genetic Algorithm
    Hussain, Hameed
    Zakarya, Muhammad
    Ali, Ahmad
    Khan, Ayaz Ali
    Qazani, Mohammad Reza Chalak
    Al-Bahri, Mahmood
    Haleem, Muhammad
    IEEE ACCESS, 2024, 12 : 54879 - 54892
  • [29] High-performance real-time implementation of a spectral estimator
    Madeira, MM
    Beltran, LAA
    Gonzalez, JS
    Nocetti, FG
    Tokhi, MO
    Ruano, MG
    ALGORITHMS AND ARCHITECTURES FOR REAL-TIME CONTROL 1998 (AARTC'98), 1998, : 185 - 189
  • [30] High-Performance and Real-Time Volume Rendering in CUDA
    Zhao, Yue
    Cui, Xiaoyu
    Cheng, Ying
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 225 - 228