High-performance medical data processing technology based on distributed parallel machine learning algorithm

被引:2
|
作者
Liu, Ji [1 ]
Liang, Xiao [2 ]
Ruan, Wenxi [3 ]
Zhang, Bo [4 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2000, Australia
[2] Shanxi VC PE Fund Management Co Ltd, Taiyuan 030000, Shanxi, Peoples R China
[3] Taizhou Vocat Coll Sci & Technol, Taizhou 318020, Peoples R China
[4] China Asset Management Co Ltd, Beijing 100033, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
Adaptive density peak clustering algorithm; Random forest algorithm; Distributed parallel classification algorithm; Cloud computing; RANDOM FOREST ALGORITHM; SEGMENTATION; IMAGES;
D O I
10.1007/s11227-021-04060-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The aim is to improve the efficiency of medical data processing and establish a sound medical data management system. To apply distributed parallel classification algorithms in the field of hospital intelligent guidance, a Parallel Random Forest (PRF) classification algorithm is proposed based on the Apache Spark cloud computing platform. Given sparse cluster loss in variable density distribution data sets, an Adaptive Domain Density Peak Clustering (ADDPC) method is proposed. Here, a Bilayer Parallel Training-Convolutional Neural Network (BPT-CNN) model based on distributed computing is proposed to detect and classify colon cancer nuclei more accurately through the large-scale parallel deep learning (DL) algorithm. Then, the performance of the proposed model is evaluated through case analysis. The results show that the PRF algorithm based on distributed cloud computing platform can independently design data-parallel tasks, thereby optimizing the data communication cost and efficiency. ADDPC algorithm can adaptively measure domain density and merge sparse clusters to prevent data loss and fragmentation. The BPT-CNN model improves the performance of the algorithm and balances the workload of each task in the algorithm. The results have a significant reference value for solving problems in medical data processing.
引用
收藏
页码:5933 / 5956
页数:24
相关论文
共 50 条
  • [31] High-Performance Concrete Strength Prediction Based on Machine Learning
    Liu, Yanning
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [32] High-Performance Concrete Strength Prediction Based on Machine Learning
    Liu, Yanning
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [33] High-performance commercial data mining: A multistrategy machine learning application
    Hsu, WH
    Welge, M
    Redman, T
    Clutter, D
    DATA MINING AND KNOWLEDGE DISCOVERY, 2002, 6 (04) : 361 - 391
  • [34] High-Performance Commercial Data Mining: A Multistrategy Machine Learning Application
    William H. Hsu
    Michael Welge
    Tom Redman
    David Clutter
    Data Mining and Knowledge Discovery, 2002, 6 : 361 - 391
  • [35] Parallel and distributed processing for high resolution agricultural tomography based on big data
    Alves, Gabriel M.
    Cruvinel, Paulo E.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 10115 - 10146
  • [36] Parallel and distributed processing for high resolution agricultural tomography based on big data
    Gabriel M. Alves
    Paulo E. Cruvinel
    Multimedia Tools and Applications, 2024, 83 : 10115 - 10146
  • [37] Distributed High-Performance Parallel Mesh Generation with ViennaMesh
    Rodriguez, Jorge
    Weinbub, Josef
    Pahr, Dieter
    Rupp, Karl
    Selberherr, Siegfried
    APPLIED PARALLEL AND SCIENTIFIC COMPUTING (PARA 2012), 2013, 7782 : 548 - 552
  • [38] A Comprehensive Pre-processing Approach for High-Performance Classification of Twitter Data with several Machine Learning Algorithms
    Sarker, Ananya
    Islam, Md Rabiul
    Srizon, Azmain Yakin
    2020 IEEE REGION 10 SYMPOSIUM (TENSYMP) - TECHNOLOGY FOR IMPACTFUL SUSTAINABLE DEVELOPMENT, 2020, : 630 - 633
  • [39] Multilevel Data Processing Using Parallel Algorithms for Analyzing Big Data in High-Performance Computing
    Ahmad, Awais
    Paul, Anand
    Din, Sadia
    Rathore, M. Mazhar
    Choi, Gyu Sang
    Jeon, Gwanggil
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2018, 46 (03) : 508 - 527
  • [40] A Survey of Distributed and Parallel Extreme Learning Machine for Big Data
    Wang, Zhiqiong
    Sui, Ling
    Xin, Junchang
    Qu, Luxuan
    Yao, Yudong
    IEEE ACCESS, 2020, 8 : 201247 - 201258