High-performance medical data processing technology based on distributed parallel machine learning algorithm

被引:2
|
作者
Liu, Ji [1 ]
Liang, Xiao [2 ]
Ruan, Wenxi [3 ]
Zhang, Bo [4 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2000, Australia
[2] Shanxi VC PE Fund Management Co Ltd, Taiyuan 030000, Shanxi, Peoples R China
[3] Taizhou Vocat Coll Sci & Technol, Taizhou 318020, Peoples R China
[4] China Asset Management Co Ltd, Beijing 100033, Peoples R China
来源
JOURNAL OF SUPERCOMPUTING | 2022年 / 78卷 / 04期
关键词
Adaptive density peak clustering algorithm; Random forest algorithm; Distributed parallel classification algorithm; Cloud computing; RANDOM FOREST ALGORITHM; SEGMENTATION; IMAGES;
D O I
10.1007/s11227-021-04060-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The aim is to improve the efficiency of medical data processing and establish a sound medical data management system. To apply distributed parallel classification algorithms in the field of hospital intelligent guidance, a Parallel Random Forest (PRF) classification algorithm is proposed based on the Apache Spark cloud computing platform. Given sparse cluster loss in variable density distribution data sets, an Adaptive Domain Density Peak Clustering (ADDPC) method is proposed. Here, a Bilayer Parallel Training-Convolutional Neural Network (BPT-CNN) model based on distributed computing is proposed to detect and classify colon cancer nuclei more accurately through the large-scale parallel deep learning (DL) algorithm. Then, the performance of the proposed model is evaluated through case analysis. The results show that the PRF algorithm based on distributed cloud computing platform can independently design data-parallel tasks, thereby optimizing the data communication cost and efficiency. ADDPC algorithm can adaptively measure domain density and merge sparse clusters to prevent data loss and fragmentation. The BPT-CNN model improves the performance of the algorithm and balances the workload of each task in the algorithm. The results have a significant reference value for solving problems in medical data processing.
引用
收藏
页码:5933 / 5956
页数:24
相关论文
共 50 条
  • [41] Multilevel Data Processing Using Parallel Algorithms for Analyzing Big Data in High-Performance Computing
    Awais Ahmad
    Anand Paul
    Sadia Din
    M. Mazhar Rathore
    Gyu Sang Choi
    Gwanggil Jeon
    International Journal of Parallel Programming, 2018, 46 : 508 - 527
  • [42] A Concept of Smart Medical Autonomous Distributed System for Diagnostics Based on Machine Learning Technology
    Velichko, Elena
    Nepomnyashchaya, Elina
    Baranov, Maxim
    Galeeva, Marina A.
    Pavlov, Vitalii A.
    Zavjalov, Sergey V.
    Savchenko, Ekaterina
    Pervunina, Tatiana M.
    Govorov, Igor
    Komlichenko, Eduard
    INTERNET OF THINGS, SMART SPACES, AND NEXT GENERATION NETWORKS AND SYSTEMS, NEW2AN 2019, RUSMART 2019, 2019, 11660 : 515 - 524
  • [43] A Parallel, Distributed, High-Performance Architecture for Simulating Particle-based Models
    Sabou, Adrian
    Gorgan, Dorian
    16TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2014), 2014, : 500 - 507
  • [44] Application of Parallel Distributed Genetics-based Machine Learning to Imbalanced Data Sets
    Nojima, Yusuke
    Mihara, Shingo
    Ishibuchi, Hisao
    2012 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2012,
  • [45] Redundant data high-efficiency compression based on distributed parallel algorithm
    Gong, Jianhu
    SOFT COMPUTING, 2021, 25 (20) : 13039 - 13051
  • [46] High-performance federated continual learning algorithm for heterogeneous streaming data
    Jiang H.
    He T.
    Liu M.
    Sun S.
    Wang Y.
    Tongxin Xuebao/Journal on Communications, 2023, 44 (05): : 123 - 136
  • [47] Image Processing Technology Based on Machine Learning
    Qiao, Qiong
    IEEE CONSUMER ELECTRONICS MAGAZINE, 2024, 13 (04) : 90 - 99
  • [48] A High-Performance, Pipelined, FPGA-Based Genetic Algorithm Machine
    Barry Shackleford
    Greg Snider
    Richard J. Carter
    Etsuko Okushi
    Mitsuhiro Yasuda
    Katsuhiko Seo
    Hiroto Yasuura
    Genetic Programming and Evolvable Machines, 2001, 2 (1) : 33 - 60
  • [49] A Scalable, High-Performance, and Fault-Tolerant Network Architecture for Distributed Machine Learning
    Wang, Songtao
    Li, Dan
    Cheng, Yang
    Geng, Jinkun
    Wang, Yanshu
    Wang, Shuai
    Xia, Shutao
    Wu, Jianping
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2020, 28 (04) : 1752 - 1764
  • [50] High-Performance Mobility Simulation: Implementation of a Parallel Distributed Message-Passing Algorithm for MATSim
    Laudan, Janek
    Heinrich, Paul
    Nagel, Kai
    INFORMATION, 2025, 16 (02)