Real-time pneumonia prediction using pipelined spark and high-performance computing

被引:0
|
作者
Ravikumar A. [1 ]
Sriraman H. [1 ]
机构
[1] School of Computer Science and Engineering, Vellore Institute of Technology, Tamil Nadu, Chennai
关键词
Convolutional neural network; Data parallel model; Distributed deep learning; High performance computing; Parameter server; Pneumonia; Prediction model; Spark;
D O I
10.7717/PEERJ-CS.1258
中图分类号
学科分类号
摘要
Background: Pneumonia is a respiratory disease caused by bacteria; it affects many people, particularly in impoverished countries where pollution, unclean living standards, overpopulation, and insufficient medical infrastructures are prevalent. To guarantee curative therapy and boost survival chances, it is vital to detect pneumonia soon enough. Imaging using chest X-rays is the most common way of detecting pneumonia. However, analyzing chest X-rays is a complex process vulnerable to subjective variation. Moreover, the data available is growing exponentially, and it will take hours and days to train the model to predict pneumonia. Timely prediction is significant to guarantee a better cure and treatment. Existing work provided by different authors needs more precision, and the computation time for predicting pneumonia is also much longer. Therefore, there is a requirement for early forecasting. Using X-ray picture samples, the system must have a continuous and unsupervised learning system for early diagnosis. Methods: In this article, the training time of the model is accelerated using the distributed data-parallel approach and the computational power of high-performance computing devices. This research aims to diagnose pneumonia using X-ray pictures with more precision, greater speed, and fewer processing resources. Distributed deep learning techniques are gaining popularity owing to the rising need for computational resources for deep learning models with several parameters. In contrast to conventional training methods, data-parallel training enables several compute nodes to train massive deep-learning models to improve training efficiency concurrently. Deploying the model in Spark solves the scalability and acceleration. Spark's distributed processing capability reads data from multiple nodes, and the results demonstrate that training time can be drastically reduced by utilizing these techniques, which is a significant necessity when dealing with large datasets. Results: The proposed model makes the prediction 1.5 times faster than the traditional CNN model used for pneumonia prediction. The model also achieved an accuracy of 98.72%. The speed-up varying from 1.2 to 1.5 was obtained in the synchronous and asynchronous parallel model. The speed-up is reduced in the parallel asynchronous model due to the presence of straggler nodes. © 2023.
引用
收藏
页码:1 / 23
页数:22
相关论文
共 50 条
  • [31] Impact of data dependencies in real-time high performance computing
    Hossain, MA
    Kabir, U
    Tokhi, MO
    MICROPROCESSORS AND MICROSYSTEMS, 2002, 26 (06) : 253 - 261
  • [32] High-Performance Real-Time Human Activity Recognition Using Machine Learning
    Thottempudi, Pardhu
    Acharya, Biswaranjan
    Moreira, Fernando
    MATHEMATICS, 2024, 12 (22)
  • [33] Research on High-Performance Real-time Data Analysis System Based on Spark Streaming in Big Data Environment
    Wang, Jialin
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 124 : 140 - 141
  • [34] Real-time respiratory motion prediction using photonic reservoir computing
    Liang, Zhizhuo
    Zhang, Meng
    Shi, Chengyu
    Huang, Z. Rena
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [35] Real-time respiratory motion prediction using photonic reservoir computing
    Zhizhuo Liang
    Meng Zhang
    Chengyu Shi
    Z. Rena Huang
    Scientific Reports, 13
  • [36] TOTAL HIGH-PERFORMANCE TIME AND DESIGN OF DEGRADABLE REAL-TIME SYSTEMS
    AKATSU, M
    MURATA, T
    KURIHARA, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 1994, E77A (03) : 510 - 516
  • [37] A Modified KNN Algorithm for High-Performance Computing on FPGA of Real-Time m-QAM Demodulators
    Marquez-Viloria, David
    Castano-Londono, Luis
    Guerrero-Gonzalez, Neil
    ELECTRONICS, 2021, 10 (05) : 1 - 14
  • [38] Modular composing high-performance real-time rendering software
    Shturtz, IV
    Belyaev, SY
    MULTIMEDIA, HYPERMEDIA AND VIRTUAL REALITY: MODELS, SYSTEMS, AND APPLICATIONS, 1996, 1077 : 130 - 135
  • [39] HLA HIGH-PERFORMANCE AND REAL-TIME SIMULATION STUDIES WITH CERTI
    Chaudron, Jean-Baptiste
    Adelantado, Martin
    Noulard, Eric
    Siron, Pierre
    EUROPEAN SIMULATION AND MODELLING CONFERENCE 2011, 2011, : 69 - +
  • [40] SIGNAL PROCESSOR ARCHITECTURE FOR HIGH-PERFORMANCE REAL-TIME APPLICATIONS
    ISHSHALOM, J
    KAZANZIDES, P
    REAL-TIME SYSTEMS SYMPOSIUM, PROCEEDINGS, 1989, : 184 - 193