Big data storage technologies: a survey

被引:55
|
作者
Siddiqa, Aisha [1 ]
Karim, Ahmad [2 ]
Gani, Abdullah [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[2] Bahauddin Zakariya Univ, Dept Informat Technol, Multan 60000, Pakistan
关键词
Big data; Big data storage; NoSQL databases; Distributed databases; CAP theorem; Scalability; Consistency-partition resilience; Availability-partition resilience; DATA REPLICATION; NOSQL DATABASES; COMMUNICATION; AVAILABILITY; SCALABILITY; CHALLENGES; SYSTEMS; CAP;
D O I
10.1631/FITEE.1500441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a great thrust in industry toward the development of more feasible and viable tools for storing fast-growing volume, velocity, and diversity of data, termed 'big data'. The structural shift of the storage mechanism from traditional data management systems to NoSQL technology is due to the intention of fulfilling big data storage requirements. However, the available big data storage technologies are inefficient to provide consistent, scalable, and available solutions for continuously growing heterogeneous data. Storage is the preliminary process of big data analytics for real-world applications such as scientific experiments, healthcare, social networks, and e-business. So far, Amazon, Google, and Apache are some of the industry standards in providing big data storage solutions, yet the literature does not report an in-depth survey of storage technologies available for big data, investigating the performance and magnitude gains of these technologies. The primary objective of this paper is to conduct a comprehensive investigation of state-of-the-art storage technologies available for big data. A well-defined taxonomy of big data storage technologies is presented to assist data analysts and researchers in understanding and selecting a storage mechanism that better fits their needs. To evaluate the performance of different storage architectures, we compare and analyze the existing approaches using Brewer's CAP theorem. The significance and applications of storage technologies and support to other categories are discussed. Several future research challenges are highlighted with the intention to expedite the deployment of a reliable and scalable storage system.
引用
收藏
页码:1040 / 1070
页数:31
相关论文
共 50 条
  • [1] Big data storage technologies: a survey
    Aisha Siddiqa
    Ahmad Karim
    Abdullah Gani
    [J]. Frontiers of Information Technology & Electronic Engineering, 2017, 18 : 1040 - 1070
  • [2] Big Data technologies: A survey
    Oussous, Ahmed
    Benjelloun, Fatima-Zahra
    Ait Lahcen, Ayoub
    Belfkih, Samir
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2018, 30 (04) : 431 - 448
  • [3] Data storage in Big Data Context: A Survey
    ELomari, A.
    Maizate, A.
    Hassouni, L.
    [J]. PROCEEDINGS OF 2016 THIRD INTERNATIONAL CONFERENCE ON SYSTEMS OF COLLABORATION (SYSCO), 2016, : P107 - P110
  • [4] A Survey on Big Data Storage Strategies
    Gazal
    Kaur, Pankaj Deep
    [J]. 2015 International Conference on Green Computing and Internet of Things (ICGCIoT), 2015, : 280 - 284
  • [5] Survey of Research on Big Data Storage
    Zhang, Xiaoxue
    Xu, Feng
    [J]. 2013 12TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING & SCIENCE (DCABES), 2013, : 76 - 80
  • [6] Elaborative Survey on Storage Technologies for XML Big Data: A Real-time Approach
    Sankari, S.
    Bose, S.
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON RECENT TRENDS IN INFORMATION TECHNOLOGY (ICRTIT), 2016,
  • [7] Emergent Technologies in Big Data Sensing: A Survey
    Zhu, Ting
    Xiao, Sheng
    Zhang, Qingquan
    Gu, Yu
    Yi, Ping
    Li, Yanhua
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [8] Big Data: Survey, Technologies, Opportunities, and Challenges
    Khan, Nawsher
    Yaqoob, Ibrar
    Hashem, Ibrahim Abaker Targio
    Inayat, Zakira
    Ali, Waleed KamaleldinMahmoud
    Alam, Muhammad
    Shiraz, Muhammad
    Gani, Abdullah
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [9] BLAST Using Big Data Technologies: A Survey
    Gaikwad, Mayur
    Ahirrao, Swati
    [J]. 2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [10] A Survey of Different Technologies and Recent Challenges of Big Data
    Dev, Dipayan
    Patgiri, Ripon
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS, ICACNI 2015, VOL 2, 2016, 44 : 537 - 548