MapReduce-based RESTMD: Enabling Large-scale Sampling Tasks with Distributed HPC Systems

被引：2

作者：

Kondikoppa, Praveenkumar ^{[1
]}

Platania, Richard ^{[1
]}

Park, Seung-Jong ^{[1
]}

Bai, Shuju ^{[2
]}

Keyes, Tom ^{[3
]}

Kim, Jaegil ^{[1
,3
]}

Kim, Nayong ^{[1
]}

Kim, Joohyun ^{[1
]}

机构：

[1] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA

[2] Southern Univ, Dept Comp Sci, Baton Rouge, LA 70813 USA

[3] Boston Univ, Dept Chem, Boston, MA USA

来源：

2014 6TH INTERNATIONAL WORKSHOP ON SCIENCE GATEWAYS (IWSG) | 2014年

基金：

美国国家科学基金会; 美国国家卫生研究院;

关键词：

MOLECULAR-DYNAMICS SIMULATIONS; REPLICA EXCHANGE; MONTE-CARLO; FRAMEWORK;

D O I：

10.1109/IWSG.2014.12

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A novel implementation of Replica Exchange Statistical Temperature Molecular Dynamics (RESTMD), belonging to a generalized ensemble method and also known as parallel tempering, is presented. Our implementation employs the Map-Reduce (MR)-based iterative framework for launching RESTMD over high performance computing (HPC) clusters including our testbed system, Cyber-infrastructure for Reconfigurable Optical Networks (CRON) simulating a network-connected distributed system. Our main contribution is a new implementation of STMD plugged into the well-known CHARMM molecular dynamics package as well as the RESTMD implementation powered by the Hadoop that scales out in a cluster and across distributed systems effectively. To address challenges for the use of Hadoop MapReduce, we examined contributing factors on the performance of the proposed framework with various runtime analysis experiments with two biological systems that differ in size and over different types of HPC resources. Many advantages with the use of RESTMD suggest its effectiveness for enhanced sampling, one of grand challenges in a variety of areas of studies ranging from chemical systems to statistical inference. Lastly, with its support for scale-across capacity over distributed computing infrastructure (DCI) and the use of Hadoop for coarse-grained task-level parallelism, MapReduce-based RESTMD represents truly a good example of the next-generation of applications whose provision is increasingly becoming demanded by science gateway projects, in particular, backed by IaaS clouds.

引用

页码：30 / 35

页数：6

共 50 条

[31] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
Shiva Asadianfam
Mahboubeh Shamsi
Abdolreza Rasouli Kenari
[J]. Multimedia Tools and Applications, 2021, 80 : 2489 - 2516
[32] TVD-MRDL: traffic violation detection system using MapReduce-based deep learning for large-scale data
Asadianfam, Shiva
Shamsi, Mahboubeh
Kenari, Abdolreza Rasouli
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (02) : 2489 - 2516
[33] One-pass MapReduce-based clustering method for mixed large scale data
Ben HajKacem, Mohamed Aymen
Ben N'cir, Chiheb-Eddine
Essoussi, Nadia
[J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2019, 52 (03) : 619 - 636
[34] One-pass MapReduce-based clustering method for mixed large scale data
Mohamed Aymen Ben HajKacem
Chiheb-Eddine Ben N’cir
Nadia Essoussi
[J]. Journal of Intelligent Information Systems, 2019, 52 : 619 - 636
[35] Enabling reuse-based software development of large-scale systems
Selby, RW
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2005, 31 (06) : 495 - 510
[36] Interactive Rendering for Large-Scale Mesh Based on MapReduce
Zhang, Hongxin
Zhu, Biao
Chen, Wei
[J]. 2013 INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS (CAD/GRAPHICS), 2013, : 345 - 352
[37] MRA*: Parallel and Distributed Path in Large-Scale Graph Using MapReduce-A* Based Approach
Hamilton Adoni, Wilfried Yves
Nahhal, Tarik
Aghezzaf, Brahim
Elbyed, Abdeltif
[J]. UBIQUITOUS NETWORKING, UNET 2017, 2017, 10542 : 390 - 401
[38] Component-based design of large-scale distributed systems
Barbier, F
[J]. 25TH ANNUAL INTERNATIONAL COMPUTER SOFTWARE & APPLICATIONS CONFERENCE, 2001, : 19 - 24
[39] Performance Optimization of HPC Applications in Large-Scale Cluster Systems
Li, Longxiang
[J]. PROCEEDINGS OF THE 2022 ACM/SPEC INTERNATIONAL CONFERENCE ON PERFORMANCE ENGINEERING (ICPE '22), 2022, : 3 - 3
[40] Time-Sharing Redux for Large-scale HPC Systems
Hofmeyr, Steven
Iancu, Costin
Colmenares, Juan A.
Roman, Eric
Austin, Brian
[J]. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2016, : 301 - 308

← 1 2 3 4 5 →