High-performance medical data processing technology based on distributed parallel machine learning algorithm

被引:2
|
作者
Liu, Ji [1 ]
Liang, Xiao [2 ]
Ruan, Wenxi [3 ]
Zhang, Bo [4 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2000, Australia
[2] Shanxi VC PE Fund Management Co Ltd, Taiyuan 030000, Shanxi, Peoples R China
[3] Taizhou Vocat Coll Sci & Technol, Taizhou 318020, Peoples R China
[4] China Asset Management Co Ltd, Beijing 100033, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
Adaptive density peak clustering algorithm; Random forest algorithm; Distributed parallel classification algorithm; Cloud computing; RANDOM FOREST ALGORITHM; SEGMENTATION; IMAGES;
D O I
10.1007/s11227-021-04060-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The aim is to improve the efficiency of medical data processing and establish a sound medical data management system. To apply distributed parallel classification algorithms in the field of hospital intelligent guidance, a Parallel Random Forest (PRF) classification algorithm is proposed based on the Apache Spark cloud computing platform. Given sparse cluster loss in variable density distribution data sets, an Adaptive Domain Density Peak Clustering (ADDPC) method is proposed. Here, a Bilayer Parallel Training-Convolutional Neural Network (BPT-CNN) model based on distributed computing is proposed to detect and classify colon cancer nuclei more accurately through the large-scale parallel deep learning (DL) algorithm. Then, the performance of the proposed model is evaluated through case analysis. The results show that the PRF algorithm based on distributed cloud computing platform can independently design data-parallel tasks, thereby optimizing the data communication cost and efficiency. ADDPC algorithm can adaptively measure domain density and merge sparse clusters to prevent data loss and fragmentation. The BPT-CNN model improves the performance of the algorithm and balances the workload of each task in the algorithm. The results have a significant reference value for solving problems in medical data processing.
引用
收藏
页码:5933 / 5956
页数:24
相关论文
共 50 条
  • [1] High-performance medical data processing technology based on distributed parallel machine learning algorithm
    Ji Liu
    Xiao Liang
    Wenxi Ruan
    Bo Zhang
    The Journal of Supercomputing, 2022, 78 : 5933 - 5956
  • [2] A High-Performance Parallel Approach to Image Processing in Distributed Computing
    Rakhimov, Mekhriddin
    Mamadjanov, Doniyor
    Mukhiddinov, Abulkosim
    2020 IEEE 14TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2020), 2020,
  • [3] Network Support for High-Performance Distributed Machine Learning
    Malandrino, Francesco
    Chiasserini, Carla Fabiana
    Molner, Nuria
    de la Oliva, Antonio
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2023, 31 (01) : 264 - 278
  • [4] Scalable, high-performance data mining with parallel processing
    Freitas, AA
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 477 - 477
  • [5] Dynamic Distributed and Parallel Machine Learning algorithms for big data mining processing
    Djafri, Laouni
    DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) : 558 - 601
  • [6] The applications of machine learning techniques in medical data processing based on distributed computing and the Internet of Things
    Aminizadeh, Sarina
    Heidari, Arash
    Toumaj, Shiva
    Darbandi, Mehdi
    Navimipour, Nima Jafari
    Rezaei, Mahsa
    Talebi, Samira
    Azad, Poupak
    Unal, Mehmet
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2023, 241
  • [7] Litz: Elastic Framework for High-Performance Distributed Machine Learning
    Qiao, Aurick
    Aghayev, Abutalib
    Yu, Weiren
    Chen, Haoyang
    Ho, Qirong
    Gibson, Garth A.
    Xing, Eric P.
    PROCEEDINGS OF THE 2018 USENIX ANNUAL TECHNICAL CONFERENCE, 2018, : 631 - 643
  • [8] Keeping up with technology: Teaching parallel, distributed, and high-performance computing
    Prasad, Sushil
    Ghafoor, Sheikh
    Barnas, Martina
    Wolf, Felix
    Saule, Erik
    Rodriguez, Noemi
    Sakellariou, Rizos
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 160 : 36 - 38
  • [9] Keeping up with technology: Teaching Parallel, Distributed and High-Performance Computing
    Prasad, Sushil K.
    Banicescu, Ioana
    Barnas, Martina
    Gimenez, Domingo
    Lumsdaine, Andrew
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2017, 105 : 1 - 3
  • [10] High-performance pediatric surgical risk calculator: A novel algorithm based on machine learning and pediatric NSQIP data
    Bertsimas, Dimitris
    Li, Michael
    Zhang, Nova
    Estrada, Carlos
    Wang, Hsin-Hsiao Scott
    AMERICAN JOURNAL OF SURGERY, 2023, 226 (01): : 115 - 121