Big data storage technologies: a survey

被引:55
|
作者
Siddiqa, Aisha [1 ]
Karim, Ahmad [2 ]
Gani, Abdullah [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[2] Bahauddin Zakariya Univ, Dept Informat Technol, Multan 60000, Pakistan
关键词
Big data; Big data storage; NoSQL databases; Distributed databases; CAP theorem; Scalability; Consistency-partition resilience; Availability-partition resilience; DATA REPLICATION; NOSQL DATABASES; COMMUNICATION; AVAILABILITY; SCALABILITY; CHALLENGES; SYSTEMS; CAP;
D O I
10.1631/FITEE.1500441
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is a great thrust in industry toward the development of more feasible and viable tools for storing fast-growing volume, velocity, and diversity of data, termed 'big data'. The structural shift of the storage mechanism from traditional data management systems to NoSQL technology is due to the intention of fulfilling big data storage requirements. However, the available big data storage technologies are inefficient to provide consistent, scalable, and available solutions for continuously growing heterogeneous data. Storage is the preliminary process of big data analytics for real-world applications such as scientific experiments, healthcare, social networks, and e-business. So far, Amazon, Google, and Apache are some of the industry standards in providing big data storage solutions, yet the literature does not report an in-depth survey of storage technologies available for big data, investigating the performance and magnitude gains of these technologies. The primary objective of this paper is to conduct a comprehensive investigation of state-of-the-art storage technologies available for big data. A well-defined taxonomy of big data storage technologies is presented to assist data analysts and researchers in understanding and selecting a storage mechanism that better fits their needs. To evaluate the performance of different storage architectures, we compare and analyze the existing approaches using Brewer's CAP theorem. The significance and applications of storage technologies and support to other categories are discussed. Several future research challenges are highlighted with the intention to expedite the deployment of a reliable and scalable storage system.
引用
收藏
页码:1040 / 1070
页数:31
相关论文
共 50 条
  • [41] Review of ultra-high density optical storage technologies for big data center
    Hao, Ruan
    Jie, Liu
    [J]. OPTICAL COMMUNICATION AND OPTICAL FIBER SENSORS AND OPTICAL MEMORIES FOR BIG DATA STORAGE, 2016, 10158
  • [42] Big data storage technologies: a case study for web-based LiDAR visualization
    Deibe, David
    Amor, Margarita
    Doallo, Ramon
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3831 - 3840
  • [44] The 51 V's Of Big Data Survey, Technologies, Characteristics, Opportunities, Issues and Challenges
    Khan, Nawsher
    Naim, Arshi
    Hussain, Mohammad Rashid
    Naveed, Quadri Noorulhasan
    Ahmad, Naim
    Qamar, Shamimul
    [J]. INTERNATIONAL CONFERENCE ON OMNI-LAYER INTELLIGENT SYSTEMS (COINS), 2019, : 19 - 24
  • [45] On the Research of Big Data Storage
    Qin, H. F.
    Qian, Z. M.
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY AND MANAGEMENT SCIENCE (ITMS 2015), 2015, 34 : 1410 - 1413
  • [46] Cognitive Storage for Big Data
    Cherubini, Giovanni
    Jelitto, Jens
    Venkatesan, Vinodh
    [J]. COMPUTER, 2016, 49 (04) : 43 - 51
  • [47] Editorial: Big data technologies and applications
    Yulei Wu
    Yi Pan
    Payam Barnaghi
    Zhiyuan Tan
    Jingguo Ge
    Hao Wang
    [J]. Wireless Networks, 2022, 28 : 1163 - 1167
  • [48] Emerging technologies in the age of big data
    Kricka, L. J.
    [J]. CLINICA CHIMICA ACTA, 2019, 493 : S755 - S755
  • [49] Editorial: Big data technologies and applications
    Wu, Yulei
    Pan, Yi
    Barnaghi, Payam
    Tan, Zhiyuan
    Ge, Jingguo
    Wang, Hao
    [J]. WIRELESS NETWORKS, 2022, 28 (03) : 1163 - 1167