MapReduce-based RESTMD: Enabling Large-scale Sampling Tasks with Distributed HPC Systems

被引:2
|
作者
Kondikoppa, Praveenkumar [1 ]
Platania, Richard [1 ]
Park, Seung-Jong [1 ]
Bai, Shuju [2 ]
Keyes, Tom [3 ]
Kim, Jaegil [1 ,3 ]
Kim, Nayong [1 ]
Kim, Joohyun [1 ]
机构
[1] Louisiana State Univ, Ctr Computat & Technol, Baton Rouge, LA 70803 USA
[2] Southern Univ, Dept Comp Sci, Baton Rouge, LA 70813 USA
[3] Boston Univ, Dept Chem, Boston, MA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
MOLECULAR-DYNAMICS SIMULATIONS; REPLICA EXCHANGE; MONTE-CARLO; FRAMEWORK;
D O I
10.1109/IWSG.2014.12
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel implementation of Replica Exchange Statistical Temperature Molecular Dynamics (RESTMD), belonging to a generalized ensemble method and also known as parallel tempering, is presented. Our implementation employs the Map-Reduce (MR)-based iterative framework for launching RESTMD over high performance computing (HPC) clusters including our testbed system, Cyber-infrastructure for Reconfigurable Optical Networks (CRON) simulating a network-connected distributed system. Our main contribution is a new implementation of STMD plugged into the well-known CHARMM molecular dynamics package as well as the RESTMD implementation powered by the Hadoop that scales out in a cluster and across distributed systems effectively. To address challenges for the use of Hadoop MapReduce, we examined contributing factors on the performance of the proposed framework with various runtime analysis experiments with two biological systems that differ in size and over different types of HPC resources. Many advantages with the use of RESTMD suggest its effectiveness for enhanced sampling, one of grand challenges in a variety of areas of studies ranging from chemical systems to statistical inference. Lastly, with its support for scale-across capacity over distributed computing infrastructure (DCI) and the use of Hadoop for coarse-grained task-level parallelism, MapReduce-based RESTMD represents truly a good example of the next-generation of applications whose provision is increasingly becoming demanded by science gateway projects, in particular, backed by IaaS clouds.
引用
收藏
页码:30 / 35
页数:6
相关论文
共 50 条
  • [1] MELT: Mapreduce-based Efficient Large-scale Trajectory Anonymization
    Ward, Katrina
    Lin, Dan
    Madria, Sanjay
    [J]. SSDBM 2017: 29TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, 2017,
  • [2] MapReduce-based Dragonfly Algorithm for large-scale Data-Clustering
    Tripathi, Ashish Kumar
    Saxena, Pranav
    Gupta, Siddharth
    [J]. 2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 171 - 175
  • [3] ARLS: A MapReduce-based output analysis tool for large-scale simulations
    Lee, Kangsun
    Jung, Kwanghoon
    Park, Joonho
    Kwon, Dongseop
    [J]. ADVANCES IN ENGINEERING SOFTWARE, 2016, 95 : 28 - 37
  • [4] A MapReduce-based artificial bee colony for large-scale data clustering
    Banharnsakun, Anan
    [J]. PATTERN RECOGNITION LETTERS, 2017, 93 : 78 - 84
  • [5] Enabling Parallel Simulation of Large-Scale HPC Network Systems
    Mubarak, Misbah
    Carothers, Christopher D.
    Ross, Robert B.
    Carns, Philip
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2017, 28 (01) : 87 - 100
  • [6] A MapReduce-based approach for shortest path problem in large-scale networks
    Aridhi, Sabeur
    Lacomme, Philippe
    Ren, Libo
    Vincent, Benjamin
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2015, 41 : 151 - 165
  • [7] Aeromancer: A Workflow Manager for Large-Scale MapReduce-Based Scientific Workflows
    Mohamed, Nabeel
    Maji, Nabanita
    Zhang, Jing
    Timoshevskaya, Nataliya
    Feng, Wu-Chun
    [J]. 2014 IEEE 13TH INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM), 2014, : 739 - 746
  • [8] A Large-Scale Implementation Using MapReduce-Based SVM for Tweets Sentiment Analysis
    Lijo, V. P.
    Seetha, Hari
    [J]. INTELLIGENT COMPUTING AND COMMUNICATION, ICICC 2019, 2020, 1034 : 541 - 549
  • [9] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Ahmed Sharafeldeen
    Mohammed Alrahmawy
    Samir Elmougy
    [J]. Scientific Reports, 13
  • [10] Graph partitioning MapReduce-based algorithms for counting triangles in large-scale graphs
    Sharafeldeen, Ahmed
    Alrahmawy, Mohammed
    Elmougy, Samir
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)