Computing large-scale alignments on a multi-cluster

被引:0
|
作者
Chen, CX [1 ]
Schmidt, B [1 ]
机构
[1] Nanyang Technol Univ, Sch Comp Sci, Singapore 2263, Singapore
关键词
sequence alignment; grid computing; MPI; dynamic programming; cluster computing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Molecular biologists frequently align DNA sequences of entire genomes to detect important matched and mismatched regions. Even though efficient dynamic programming algorithms exist for this problem, the required computing time is still very high due to the size of these sequences (usually a few million base pairs in length). Because the number of sequenced organisms is increasing rapidly, fast and accurate solutions are of highest importance to research in this area. In this paper we present an algorithm to compute the optimal and near-optimal alignments of two sequences in linear space and quadratic time. We demonstrate how this algorithm can be parallelized efficiently on a PC cluster and on a computational grid in order to reduce its runtime significantly. The grid implementation uses a hierarchical approach combining inter-cluster and intra-cluster parallelism.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [1] Large-Scale Pairwise Sequence Alignments on a Large-Scale GPU Cluster
    Savran, Ibrahim
    Gao, Yang
    Bakos, Jason D.
    [J]. IEEE DESIGN & TEST, 2014, 31 (01) : 51 - 61
  • [2] Large-Scale Multi-Cluster MIMO Approach for Cognitive Radio Sensor Networks
    Hefnawi, Mostafa
    [J]. IEEE SENSORS JOURNAL, 2016, 16 (11) : 4418 - 4424
  • [3] ALIGNMENTS OF BRIGHTEST CLUSTER GALAXIES WITH LARGE-SCALE STRUCTURES
    LAMBAS, DG
    GROTH, EJ
    PEEBLES, PJE
    [J]. ASTRONOMICAL JOURNAL, 1988, 95 (04): : 996 - 998
  • [4] Large-scale alignments from WMAP and Planck
    Copi, Craig J.
    Huterer, Dragan
    Schwarz, Dominik J.
    Starkman, Glenn D.
    [J]. MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2015, 449 (04) : 3458 - 3470
  • [5] Multi-cluster computing interconnection network performance modeling and analysis
    Javadi, Bahman
    Akbari, Mohammad K.
    Abawajy, Jemal H.
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2009, 25 (07): : 737 - 746
  • [6] Analytical interconnection networks model for multi-cluster computing systems
    Javadi, Bahman
    Abawajy, Jemal H.
    Akbari, Mohammad K.
    [J]. ASMTA 2006: 13TH INTERNATIONAL CONFERENCE ON ANALYTICAL AND STOCHASTIC MODELLING TECHNIQUES AND APPLICATIONS, PROCEEDINGS, 2006, : 37 - 42
  • [7] Multi-cluster computing interconnection network performance modeling and analysis
    Javadi, Bahman
    Akbari, Mohammad K.
    Abawajy, Jemal H.
    Nahavandi, Sacid
    [J]. 2006 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, VOLS 1 AND 2, 2007, : 86 - +
  • [8] Dynamic parallel job scheduling in multi-cluster computing systems
    Abawajy, JH
    [J]. COMPUTATIONAL SCIENCE - ICCS 2004, PT 1, PROCEEDINGS, 2004, 3036 : 27 - 34
  • [9] Hierarchical Spark: A Multi-cluster Big Data Computing Framework
    Liu, Zixia
    Zhang, Hong
    Wang, Liqiang
    [J]. 2017 IEEE 10TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2017, : 90 - 97
  • [10] GeoSpark: A Cluster Computing Framework for Processing Large-Scale Spatial Data
    Yu, Jia
    Wu, Jinxuan
    Sarwat, Mohamed
    [J]. 23RD ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2015), 2015,