MapReduce-based RESTMD: Enabling Large-scale Sampling Tasks with Distributed HPC Systems

被引:2
|
作者
Kondikoppa, Praveenkumar [1 ]
Platania, Richard [1 ]
Park, Seung-Jong [1 ]
Bai, Shuju [2 ]
Keyes, Tom [3 ]
Kim, Jaegil [1 ,3 ]
Kim, Nayong [1 ]
Kim, Joohyun [1 ]
机构
[1] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
[2] Southern Univ, Dept Comp Sci, Baton Rouge, LA 70813 USA
[3] Boston Univ, Dept Chem, Boston, MA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
MOLECULAR-DYNAMICS SIMULATIONS; REPLICA EXCHANGE; MONTE-CARLO; FRAMEWORK;
D O I
10.1109/IWSG.2014.12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel implementation of Replica Exchange Statistical Temperature Molecular Dynamics (RESTMD), belonging to a generalized ensemble method and also known as parallel tempering, is presented. Our implementation employs the Map-Reduce (MR)-based iterative framework for launching RESTMD over high performance computing (HPC) clusters including our testbed system, Cyber-infrastructure for Reconfigurable Optical Networks (CRON) simulating a network-connected distributed system. Our main contribution is a new implementation of STMD plugged into the well-known CHARMM molecular dynamics package as well as the RESTMD implementation powered by the Hadoop that scales out in a cluster and across distributed systems effectively. To address challenges for the use of Hadoop MapReduce, we examined contributing factors on the performance of the proposed framework with various runtime analysis experiments with two biological systems that differ in size and over different types of HPC resources. Many advantages with the use of RESTMD suggest its effectiveness for enhanced sampling, one of grand challenges in a variety of areas of studies ranging from chemical systems to statistical inference. Lastly, with its support for scale-across capacity over distributed computing infrastructure (DCI) and the use of Hadoop for coarse-grained task-level parallelism, MapReduce-based RESTMD represents truly a good example of the next-generation of applications whose provision is increasingly becoming demanded by science gateway projects, in particular, backed by IaaS clouds.
引用
收藏
页码:30 / 35
页数:6
相关论文
共 50 条
  • [31] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
    Shiva Asadianfam
    Mahboubeh Shamsi
    Abdolreza Rasouli Kenari
    [J]. Multimedia Tools and Applications, 2021, 80 : 2489 - 2516
  • [32] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
    Asadianfam, Shiva
    Shamsi, Mahboubeh
    Kenari, Abdolreza Rasouli
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 2489 - 2516
  • [33] One-pass MapReduce-based clustering method for mixed large scale data
    Ben HajKacem, Mohamed Aymen
    Ben N'cir, Chiheb-Eddine
    Essoussi, Nadia
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2019, 52 (03) : 619 - 636
  • [34] One-pass MapReduce-based clustering method for mixed large scale data
    Mohamed Aymen Ben HajKacem
    Chiheb-Eddine Ben N’cir
    Nadia Essoussi
    [J]. Journal of Intelligent Information Systems, 2019, 52 : 619 - 636
  • [35] Enabling reuse-based software development of large-scale systems
    Selby, RW
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (06) : 495 - 510
  • [36] Interactive Rendering for Large-Scale Mesh Based on MapReduce
    Zhang, Hongxin
    Zhu, Biao
    Chen, Wei
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2013, : 345 - 352
  • [37] MRA*: Parallel and Distributed Path in Large-Scale Graph Using MapReduce-A* Based Approach
    Hamilton Adoni, Wilfried Yves
    Nahhal, Tarik
    Aghezzaf, Brahim
    Elbyed, Abdeltif
    [J]. UBIQUITOUS NETWORKING, UNET 2017, 2017, 10542 : 390 - 401
  • [38] Component-based design of large-scale distributed systems
    Barbier, F
    [J]. 25TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 2001, : 19 - 24
  • [39] Performance Optimization of HPC Applications in Large-Scale Cluster Systems
    Li, Longxiang
    [J]. PROCEEDINGS OF THE 2022 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '22), 2022, : 3 - 3
  • [40] Time-Sharing Redux for Large-scale HPC Systems
    Hofmeyr, Steven
    Iancu, Costin
    Colmenares, Juan A.
    Roman, Eric
    Austin, Brian
    [J]. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 301 - 308