Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges

被引:36
|
作者
Yin, Zekun [1 ]
Lan, Haidong [1 ]
Tan, Guangming [2 ]
Lu, Mian [3 ]
Vasilakos, Athanasios V. [4 ]
Liu, Weiguo [1 ]
机构
[1] Shandong Univ, Jinan, Shandong, Peoples R China
[2] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
[3] Huawei Singapore Res Ctr, Singapore, Singapore
[4] Lulea Univ Technol, Dept Comp Sci Elect & Space Engn, SE-93187 Skelleftea, Sweden
关键词
Computational biology applications; Computing platforms; Big biological data; NGS; GPU; Intel MIC; MULTIPLE SEQUENCE ALIGNMENT; READ ALIGNMENT; SPEED-UP; PARALLEL; IMPLEMENTATION; ALGORITHM; SEARCH; CUDA; ACCELERATION; CLUSTALW;
D O I
10.1016/j.csbj.2017.07.004
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics. (C) 2017 The Authors. Published by Elsevier B.V.
引用
下载
收藏
页码:403 / 411
页数:9
相关论文
共 50 条
  • [31] Does big data mean big knowledge? KM perspectives on big data and analytics
    Pauleen, David J.
    Wang, William Y. C.
    JOURNAL OF KNOWLEDGE MANAGEMENT, 2017, 21 (01) : 1 - 6
  • [32] Big Data Analytics: Key Technologies and Challenges
    Hu Shuijing
    2016 INTERNATIONAL CONFERENCE ON ROBOTS & INTELLIGENT SYSTEM (ICRIS), 2016, : 141 - 145
  • [33] Big Data Analytics in Education: Prospects and Challenges
    Hossain, Syed Akhter
    2015 4TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (ICRITO) (TRENDS AND FUTURE DIRECTIONS), 2015,
  • [34] Big Data Analytics for Security and Privacy Challenges
    Mishra, Aditya Dev
    Singh, Youddha Beer
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND AUTOMATION (ICCCA), 2016, : 50 - 53
  • [35] Challenges of Feature Selection for Big Data Analytics
    Li, Jundong
    Liu, Huan
    IEEE INTELLIGENT SYSTEMS, 2017, 32 (02) : 9 - 15
  • [36] Big Data Analytics: Security and Privacy Challenges
    Gahi, Youssef
    Guennoun, Mouhcine
    Mouftah, Hussein T.
    2016 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATION (ISCC), 2016, : 952 - 957
  • [37] Challenges of Feature Selection for Big Data Analytics
    Li J.
    Liu H.
    1600, Institute of Electrical and Electronics Engineers Inc., United States (32): : 9 - 15
  • [38] Challenges of Privacy Protection in Big Data Analytics
    Jensen, Meiko
    2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA, 2013, : 235 - 238
  • [39] Challenges in Big Data Analytics Techniques: A Survey
    Komalavalli, C.
    Laroiya, Chetna
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 223 - 228
  • [40] Big Data Processing in the Cloud - Challenges and Platforms
    Zhelev, Svetoslav
    Rozeva, Anna
    PROCEEDINGS OF THE 43RD INTERNATIONAL CONFERENCE APPLICATIONS OF MATHEMATICS IN ENGINEERING AND ECONOMICS (AMEE'17), 2017, 1910