Bwasw-Cloud: Efficient Sequence Alignment Algorithm for Two Big Data with MapReduce

被引:0
|
作者
Sun, Mingming [1 ]
Zhou, Xuehai [1 ]
Yang, Feng [1 ]
Lu, Kun [1 ]
Dai, Dong [2 ]
机构
[1] Univ Sci & Technol China, Comp Sci, Hefei 230026, Peoples R China
[2] Texas Tech Univ, Comp Sci, Lubbock, TX 79409 USA
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent next-generation sequencing machines generate sequences at an unprecedented rate, and a sequence is not short any more called read. The reference sequences which are aligned reads against are also increasingly large. Efficiently mapping large number of long sequences with big reference sequences poses a new challenge to sequence alignment. Sequence alignment algorithms become to match on two big data. To address the above problem, we propose a new parallel sequence alignment algorithm called Bwasw-Cloud, optimized for aligning long reads against a large sequence data (e.g. the human genome). It is modeled after the widely used BWA-SW algorithm and uses the open-source Hadoop implementation of Map Reduce. The results show that Bwasw-Cloud can effectively and quickly match two big data in common cluster.
引用
收藏
页码:213 / 218
页数:6
相关论文
共 50 条
  • [41] A Crowdsourcing Worker Quality Evaluation Algorithm on MapReduce for Big Data Applications
    Dang, Depeng
    Liu, Ying
    Zhang, Xiaoran
    Huang, Shihang
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (07) : 1879 - 1888
  • [42] Trusted cloud computing platform poly source big data time sequence scheduling algorithm
    Du R.-S.
    Chen Y.-X.
    Meng L.-D.
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2023, 53 (11): : 3194 - 3200
  • [43] Hybrid Data Mining Algorithm in Cloud Computing using MapReduce Framework
    Sahay, Siddharth
    Khetarpal, Suruchi
    Pradhan, Tribikram
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2016, : 507 - 511
  • [44] A More Efficient and Effective Heuristic Algorithm for the MapReduce Placement Problem in Cloud Computing
    Xu, Xiaoyong
    Tang, Maolin
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 264 - 271
  • [45] An enhanced and efficient clustering algorithm for large data using MapReduce
    Li, Hongbiao
    Liu, Ruiying
    Wang, Jingdong
    Wu, Qilong
    IAENG International Journal of Computer Science, 2019, 46 (01)
  • [46] Hybrid Parallel Linguistic Fuzzy Rules with Canopy MapReduce for Big Data Classification in Cloud
    Vennila, V.
    Kannan, A. Rajiv
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2019, 21 (03) : 809 - 822
  • [47] Privacy Preserving Over Big Data Through VSSFA and MapReduce Framework in Cloud Environment
    Thiyagarajan, V. S.
    Ayyasamy, A.
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 97 (04) : 6239 - 6263
  • [48] Privacy Preserving Over Big Data Through VSSFA and MapReduce Framework in Cloud Environment
    V. S. Thiyagarajan
    A. Ayyasamy
    Wireless Personal Communications, 2017, 97 : 6239 - 6263
  • [49] Hybrid Parallel Linguistic Fuzzy Rules with Canopy MapReduce for Big Data Classification in Cloud
    V. Vennila
    A. Rajiv Kannan
    International Journal of Fuzzy Systems, 2019, 21 : 809 - 822
  • [50] A MapReduce Based Approach of Scalable Multidimensional Anonymization for Big Data Privacy Preservation on Cloud
    Zhang, Xuyun
    Yang, Chi
    Nepal, Surya
    Liu, Chang
    Dou, Wanchun
    Chen, Jinjun
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 105 - 112