Bwasw-Cloud: Efficient Sequence Alignment Algorithm for Two Big Data with MapReduce

被引:0
|
作者
Sun, Mingming [1 ]
Zhou, Xuehai [1 ]
Yang, Feng [1 ]
Lu, Kun [1 ]
Dai, Dong [2 ]
机构
[1] Univ Sci & Technol China, Comp Sci, Hefei 230026, Peoples R China
[2] Texas Tech Univ, Comp Sci, Lubbock, TX 79409 USA
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The recent next-generation sequencing machines generate sequences at an unprecedented rate, and a sequence is not short any more called read. The reference sequences which are aligned reads against are also increasingly large. Efficiently mapping large number of long sequences with big reference sequences poses a new challenge to sequence alignment. Sequence alignment algorithms become to match on two big data. To address the above problem, we propose a new parallel sequence alignment algorithm called Bwasw-Cloud, optimized for aligning long reads against a large sequence data (e.g. the human genome). It is modeled after the widely used BWA-SW algorithm and uses the open-source Hadoop implementation of Map Reduce. The results show that Bwasw-Cloud can effectively and quickly match two big data in common cluster.
引用
收藏
页码:213 / 218
页数:6
相关论文
共 50 条
  • [1] Efficient Alignment of Next Generation Sequencing Data Using MapReduce on the Cloud
    AlSaad, Rawan
    Malluhi, Qutaibah
    Abouelhoda, Mohamed
    2012 CAIRO INTERNATIONAL BIOMEDICAL ENGINEERING CONFERENCE (CIBEC), 2012, : 18 - 22
  • [2] An accurate algorithm for multiple sequence alignment in MapReduce
    Dong, Gaifang
    Fu, Xueliang
    Li, Honghui
    Li, Jianrong
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2018, 18 (01) : 283 - 295
  • [3] Utilizing the Buckshot Algorithm for Efficient Big Data Clustering in the MapReduce Model
    Gerakidis, Sergios
    Mamalis, Basilis
    PROCEEDINGS OF THE 23RD PAN-HELLENIC CONFERENCE OF INFORMATICS (PCI 2019), 2019, : 112 - 117
  • [4] Cross-Cloud MapReduce for Big Data
    Li, Peng
    Guo, Song
    Yu, Shui
    Zhuang, Weihua
    IEEE TRANSACTIONS ON CLOUD COMPUTING, 2020, 8 (02) : 375 - 386
  • [5] Efficient Big Data Processing in Hadoop MapReduce
    Dittrich, Jens
    Quiane-Ruiz, Jorge-Arnulfo
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (12): : 2014 - 2015
  • [6] Study on Cloud Storage based on the MapReduce for Big Data
    Huang Yi
    Ma Xinqiang
    Zhang Yongdan
    Liu Youyuan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2015, 8 : 1601 - 1605
  • [7] GreeDi: An energy efficient routing algorithm for big data on cloud
    Baker, T.
    Al-Dawsari, B.
    Tawfik, H.
    Reid, D.
    Ngoko, Y.
    AD HOC NETWORKS, 2015, 35 : 83 - 96
  • [8] Handling Big Data Using MapReduce Over Hybrid Cloud
    Saxena, Ankur
    Chaurasia, Ankur
    Kaushik, Neeraj
    Kaushik, Nidhi
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, VOL 2, 2019, 56 : 135 - 144
  • [9] An efficient sequence alignment algorithm on a LARPBS
    Seme, David
    Youlou, Sidney
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2007, PT 3, PROCEEDINGS, 2007, 4707 : 379 - +
  • [10] An Efficient Algorithm for Local Sequence Alignment
    Haque, Waqar
    Aravind, Alex
    Reddy, Bharath
    2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, : 1367 - 1372