Fault tolerance in big data storage and processing systems: A review on challenges and solutions

被引:14
|
作者
Saadoon, Muntadher [1 ]
Ab Hamid, Siti Hafizah [1 ]
Sofian, Hazrina [1 ]
Altarturi, Hamza H. M. [1 ]
Azizul, Zati Hakim [1 ]
Nasuha, Nur [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Software Engn, Kuala Lumpur 50603, Malaysia
关键词
Fault tolerance; Fault detection; Fault recovery; Big data storage; Big data processing; FAILURE RECOVERY; MAPREDUCE; RELIABILITY; AVAILABILITY; REPLICATION; NETWORKS; HADOOP;
D O I
10.1016/j.asej.2021.06.024
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Big data systems are sufficiently stable to store and process a massive volume of rapidly changing data. However, big data systems are composed of large-scale hardware resources that make their subspecies easily fail. Fault tolerance is the main property of such systems because it maintains availability, reliability, and constant performance during faults. Achieving an efficient fault tolerance solution in a big data system is challenging because fault tolerance must meet some constraints related to the system performance and resource consumption. This study aims to provide a consistent understanding of fault tolerance in big data systems and highlights common challenges that hinder the improvement in fault tolerance efficiency. The fault tolerance solutions applied by previous studies intended to address the identified challenges are reviewed. The paper also presents a perceptive discussion of the findings derived from previous studies and proposes a list of future directions to address the fault tolerance challenges. (C) 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Big Data Security Challenges and Preventive Solutions
    Gupta, Nirmal Kumar
    Rohil, Mukesh Kumar
    DATA MANAGEMENT, ANALYTICS AND INNOVATION, ICDMAI 2019, VOL 1, 2020, 1042 : 285 - 299
  • [42] Challenges and Solutions in Big data management - An Overview
    Kanchi, Sravanthi
    Sandilya, Shubhrika
    Ramkrishna, Shashank
    Manjrekar, Siddhesh
    Vhadgar, Akshata
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 418 - 426
  • [43] A multi-factor monitoring fault tolerance model based on a GPU cluster for big data processing
    Fang, Yuling
    Chen, Qingkui
    Xiong, Naixue
    INFORMATION SCIENCES, 2019, 496 : 300 - 316
  • [44] A review of OBN processing: challenges and solutions
    Zhang, Dongliang
    Tsingas, Constantinos
    Ghamdi, Ahmed A.
    Huang, Mingzhong
    Jeong, Woodon
    Sliz, Krzysztof K.
    Aldeghaither, Saud M.
    Zahrani, Saeed A.
    JOURNAL OF GEOPHYSICS AND ENGINEERING, 2021, 18 (04) : 492 - 502
  • [45] IoT-Based Big Data Storage Systems in Cloud Computing: Perspectives and Challenges
    Cai, Hongming
    Xu, Boyi
    Jiang, Lihong
    Vasilakos, Athanasios V.
    IEEE INTERNET OF THINGS JOURNAL, 2017, 4 (01): : 75 - 87
  • [46] REAPS: Quasi-active Fault Tolerance for Big Data Publish-Subscribe Systems
    Nguyen, Hang
    Uddin, M. Y. S.
    Venkatasubramanian, Nalini
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 368 - 375
  • [47] Low-overhead fault tolerance for high-throughput data processing systems
    Martin, Andre
    Knauth, Thomas
    Creutz, Stephan
    Becker, Diogo
    Weigert, Stefan
    Fetzer, Christof
    Brito, Andrey
    31ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2011), 2011, : 689 - 699
  • [48] Online Fault Detection and Fault Tolerance in Electrical Energy Storage Systems
    Wang, Yanzhi
    Lin, Xue
    Pedram, Massoud
    Chang, Naehyuck
    2014 IEEE PES GENERAL MEETING - CONFERENCE & EXPOSITION, 2014,
  • [49] Challenges and Benefits of Deploying Big Data Storage Solution
    Kachaoui, Jabrane
    Belangour, Abdessamad
    PROCEEDINGS OF THE SECOND CONFERENCE OF THE MOROCCAN CLASSIFICATION SOCIETY: NEW CHALLENGES IN DATA SCIENCES (SMC '2019), 2019, : 150 - 154
  • [50] High Performance and Fault Tolerant Distributed File System for Big Data Storage and Processing using Hadoop
    Sivaraman, E.
    Manickachezian, R.
    2014 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING APPLICATIONS (ICICA 2014), 2014, : 32 - 36