An automated parallel genetic algorithm with parametric adaptation for distributed data analysis

被引:0
|
作者
Al-Terkawi, Laila [1 ]
Migliavacca, Matteo [2 ]
机构
[1] Int Univ Kuwait IUK, Al Ardiya, Kuwait
[2] Univ Kent, Sch Comp, Canterbury, England
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
关键词
Genetic algorithms (GAs); GAs parameter control; Classification; Large-scale data processing; Spark; CONVERGENCE;
D O I
10.1038/s41598-025-93943-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Unleashing the potential of large-scale data analysis requires advanced computational methods capable of managing the immense size and complexity of distributed data. Genetic algorithms (GAs), known for their adaptability, benefit significantly from parallelization, prompting ongoing enhancements to boost performance further. This study proposes the integration of automatic termination and population sizing mechanisms into parallel GAs to augment their flexibility and effectiveness. We extend PDMS-BioHEL and PDMD-BioHEL, two parallel GA-based classifiers implemented on the Spark platform, and through extensive experimentation, demonstrate the efficacy of our approach in enhancing computational efficiency and user-friendliness. However, while these automated strategies significantly reduce the need for manual parameter tuning, thereby increasing time efficiency, they may sometimes lead to a slight reduction in final solution accuracy, particularly under complex scenario conditions. This trade-off between efficiency and accuracy is critical, especially when high precision is paramount. Our techniques enable more efficient and effective large-scale data analysis using parallel GAs, providing a robust foundation for future advancements and inviting further investigation into balancing these aspects.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Parallel genetic algorithm with parameter adaptation
    Tongchim, S
    Chongstitvatana, P
    INFORMATION PROCESSING LETTERS, 2002, 82 (01) : 47 - 54
  • [2] Efficient distributed mesh data structure for parallel automated adaptive analysis
    E. Seegyoung Seol
    Mark S. Shephard
    Engineering with Computers, 2006, 22 : 197 - 213
  • [3] Efficient distributed mesh data structure for parallel automated adaptive analysis
    Seol, E. Seegyoung
    Shephard, Mark S.
    ENGINEERING WITH COMPUTERS, 2006, 22 (3-4) : 197 - 213
  • [4] The Design and Analysis of an Improved Parallel Genetic Algorithm Based on Distributed System
    Chen, Yan
    Li, Zhimei
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRONIC & MECHANICAL ENGINEERING AND INFORMATION TECHNOLOGY (EMEIT-2012), 2012, 23
  • [5] A parallel Genetic Algorithm with distributed environment scheme
    Kaneko, M
    Miki, M
    Hiroyasu, T
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, 2000, : 619 - 625
  • [6] Cloud Parallel Genetic Algorithm for Gene Microarray Data Analysis
    Palomino, Rommel A. Benites
    Liang, Lily R.
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 932 - 933
  • [7] Analysis on parallel genetic algorithm
    Zeng, G.S.
    Ding, C.L.
    Jisuanji Gongcheng/Computer Engineering, 2001, 27 (09):
  • [8] Distributed Genetic Algorithm using Automated Adaptive Migration
    Lee, Hyunjung
    Oh, Byonghwa
    Yang, Jihoon
    Kim, Seonho
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1835 - 1840
  • [9] A parallel and distributed Genetic Algorithm for the Traveling Salesman Problem
    Sena, G
    Isern, G
    Megherbi, D
    PROCEEDINGS OF THE HIGH PERFORMANCE COMPUTING SYMPOSIUM - HPC '99, 1999, : 319 - 324
  • [10] Effects of chromosome migration on a parallel and distributed genetic algorithm
    Matsumura, T
    Nakamura, M
    Miyazato, D
    Onaga, K
    Okech, J
    THIRD INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS, AND NETWORKS, PROCEEDINGS (I-SPAN '97), 1997, : 357 - 361