Real-time pneumonia prediction using pipelined spark and high-performance computing

被引:0
|
作者
Ravikumar A. [1 ]
Sriraman H. [1 ]
机构
[1] School of Computer Science and Engineering, Vellore Institute of Technology, Tamil Nadu, Chennai
关键词
Convolutional neural network; Data parallel model; Distributed deep learning; High performance computing; Parameter server; Pneumonia; Prediction model; Spark;
D O I
10.7717/PEERJ-CS.1258
中图分类号
学科分类号
摘要
Background: Pneumonia is a respiratory disease caused by bacteria; it affects many people, particularly in impoverished countries where pollution, unclean living standards, overpopulation, and insufficient medical infrastructures are prevalent. To guarantee curative therapy and boost survival chances, it is vital to detect pneumonia soon enough. Imaging using chest X-rays is the most common way of detecting pneumonia. However, analyzing chest X-rays is a complex process vulnerable to subjective variation. Moreover, the data available is growing exponentially, and it will take hours and days to train the model to predict pneumonia. Timely prediction is significant to guarantee a better cure and treatment. Existing work provided by different authors needs more precision, and the computation time for predicting pneumonia is also much longer. Therefore, there is a requirement for early forecasting. Using X-ray picture samples, the system must have a continuous and unsupervised learning system for early diagnosis. Methods: In this article, the training time of the model is accelerated using the distributed data-parallel approach and the computational power of high-performance computing devices. This research aims to diagnose pneumonia using X-ray pictures with more precision, greater speed, and fewer processing resources. Distributed deep learning techniques are gaining popularity owing to the rising need for computational resources for deep learning models with several parameters. In contrast to conventional training methods, data-parallel training enables several compute nodes to train massive deep-learning models to improve training efficiency concurrently. Deploying the model in Spark solves the scalability and acceleration. Spark's distributed processing capability reads data from multiple nodes, and the results demonstrate that training time can be drastically reduced by utilizing these techniques, which is a significant necessity when dealing with large datasets. Results: The proposed model makes the prediction 1.5 times faster than the traditional CNN model used for pneumonia prediction. The model also achieved an accuracy of 98.72%. The speed-up varying from 1.2 to 1.5 was obtained in the synchronous and asynchronous parallel model. The speed-up is reduced in the parallel asynchronous model due to the presence of straggler nodes. © 2023.
引用
收藏
页码:1 / 23
页数:22
相关论文
共 50 条
  • [1] Real-time pneumonia prediction using pipelined spark and high-performance computing
    Ravikumar, Aswathy
    Sriraman, Harini
    PEERJ COMPUTER SCIENCE, 2023, 9
  • [2] High-performance scalable computing for real-time applications
    Boggess, T
    Shirley, F
    SIXTH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, PROCEEDINGS, 1997, : 332 - 335
  • [3] High-performance computing in real-time ultrasonic imaging
    Nocetti, DFG
    González, JS
    Casique, MFV
    Ramirez, RO
    Hernández, EM
    ACOUSTICAL IMAGING, VOL 24, 2000, 24 : 113 - 120
  • [4] High-performance computing for real-time spectral estimation
    Madeira, MM
    Bellis, SJ
    Beltran, LAA
    González, JS
    Nocetti, DFG
    Marnane, WP
    Tokhi, MO
    Ruano, MG
    CONTROL ENGINEERING PRACTICE, 1999, 7 (05) : 679 - 686
  • [5] Timing Predictability in High-Performance Computing With Probabilistic Real-Time
    Reghenzani, Federico
    Massari, Giuseppe
    Fornaciari, William
    IEEE ACCESS, 2020, 8 (08): : 208566 - 208582
  • [6] High-performance computing nodes for real-time parallel applications
    Carden, TC
    Dobinson, RW
    Fisher, S
    Maley, PD
    NUCLEAR INSTRUMENTS & METHODS IN PHYSICS RESEARCH SECTION A-ACCELERATORS SPECTROMETERS DETECTORS AND ASSOCIATED EQUIPMENT, 1997, 394 (1-2): : 211 - 218
  • [7] REAL-TIME PROCESSING - A GROWING DOMAIN OF HIGH-PERFORMANCE COMPUTING
    MALINOWSKI, CW
    ELECTRONIC ENGINEERING, 1989, 61 (748): : 55 - &
  • [8] Elastic High-performance Computing Platform for Real-time Data Analysis
    Simchev, T.
    APPLICATION OF MATHEMATICS IN TECHNICAL AND NATURAL SCIENCES (AMITANS'18), 2018, 2025
  • [9] HiperView: real-time monitoring of dynamic behaviors of high-performance computing centers
    Tommy Dang
    Ngan Nguyen
    Yong Chen
    The Journal of Supercomputing, 2021, 77 : 11807 - 11826
  • [10] Gisola: A High-Performance Computing Application for Real-Time Moment Tensor Inversion
    Triantafyllis, Nikolaos
    Venetis, Ioannis E.
    Fountoulakis, Ioannis
    Pikoulis, Erion-Vasilis
    Sokos, Efthimios
    Evangelidis, Christos P.
    SEISMOLOGICAL RESEARCH LETTERS, 2022, 93 (2A) : 957 - 966