Large-scale DNA sequence assembly by using computing grid

被引:0
|
作者
Fang, Xiaoyong [1 ]
Luo, Zhigang [1 ]
Wang, Zhenghua [1 ]
Ding, Fan [1 ]
机构
[1] Natl Univ Def Technol, Natl Lab Parallel & Distributed Proc, Changsha 410073, Hunan, Peoples R China
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
DNA sequence assembly is a fundamental part of biological computing. However, most of the large-scale sequence assemblies require intensive computing power and huge storage. To speed up the assembly process, we here propose a method for large-scale DNA sequence assembly by using computing grid The central idea of our method is to first cluster the input of fragment set into many non-intersected subsets using k-mers and then to distribute them to all nodes of the grid-computing system. Our method has accuracy of more than 92% on the test data sets under the simulated grid-computing system but costing shorter time and lower storage. Our method can efficiently process large-scale DNA sequence assembly by taking advantage of huge storage and computing capacity of computing gird.
引用
收藏
页码:397 / +
页数:2
相关论文
共 50 条
  • [1] Large-scale biological sequence assembly and alignment by using computing grid
    Shi, W
    Zhou, WL
    [J]. GRID AND COOPERATIVE COMPUTING, PT 1, 2004, 3032 : 26 - 33
  • [2] Volume Rendering using Grid Computing for Large-Scale Volume Data
    Nishihashi, Kunihiko
    Higaki, Toru
    Okabe, Kenji
    Raytchev, Bisser
    Tamaki, Toni
    Kaneda, Kazufumi
    [J]. 2009 11TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN AND COMPUTER GRAPHICS, PROCEEDINGS, 2009, : 470 - 477
  • [3] Large-Scale High-Resolution Groundwater Modelling using Grid Computing
    Berendrecht, W. L.
    Lourens, A.
    Snepvangers, J. J. J. C.
    Minnema, B.
    [J]. MODSIM 2007: INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: LAND, WATER AND ENVIRONMENTAL MANAGEMENT: INTEGRATED SYSTEMS FOR SUSTAINABILITY, 2007, : 1954 - 1958
  • [4] Grid authorization management oriented to large-scale collaborative computing
    Huang, CQ
    Zhu, ZT
    Wang, XQ
    Chen, D
    [J]. COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN I, 2004, 3168 : 55 - 66
  • [5] Large-Scale Cloud Computing Research: Sky Computing on FutureGrid and Grid' 5000
    Riteau, Pierre
    Tsugawa, Mauricio
    Matsunaga, Andrea
    Fortes, Jose
    Keahey, Kate
    [J]. ERCIM NEWS, 2010, (83): : 41 - 42
  • [6] A parallel Euler approach for large-scale biological sequence assembly
    Shi, W
    Zhou, WL
    [J]. Third International Conference on Information Technology and Applications, Vol 1, Proceedings, 2005, : 437 - 441
  • [7] LARGE-SCALE AND AUTOMATED DNA-SEQUENCE DETERMINATION
    HUNKAPILLER, T
    KAISER, RJ
    KOOP, BF
    HOOD, L
    [J]. SCIENCE, 1991, 254 (5028) : 59 - 67
  • [8] An end-to-end workflow pipeline for large-scale Grid computing
    McGough A.S.
    Cohen J.
    Darlington J.
    Katsiri E.
    Lee W.
    Panagiotidi S.
    Patel Y.
    [J]. Journal of Grid Computing, 2005, 3 (3-4) : 259 - 281
  • [9] Large-scale grid computing for content-based image retrieval
    Town, Chris
    Harrison, Karl
    [J]. ASLIB PROCEEDINGS, 2010, 62 (4-5): : 438 - 446
  • [10] SCOUT: A Monitor & Profiler of Grid Resources for Large-Scale Scientific Computing
    Hossain, Md Azam
    Vu, Hieu Trong
    Kim, Jik-Soo
    Lee, Myungho
    Hwang, Soonwook
    [J]. 2015 INTERNATIONAL CONFERENCE ON CLOUD AND AUTONOMIC COMPUTING (ICCAC), 2015, : 260 - 267