Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19

被引:15
|
作者
Azeroual, Otmane [1 ]
Fabre, Renaud [2 ]
机构
[1] German Ctr Higher Educ Res & Sci Studies DZHW, D-10117 Berlin, Germany
[2] Univ Paris 08, Dionysian Econ Lab LED, F-93200 St Denis, France
关键词
big data; data processing; unstructured data; large amounts of data; COVID-19; challenges; Hadoop technology; MapReduce; WordCount; ANALYTICS;
D O I
10.3390/bdcc5010012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting. As experienced in former massive information issues, big data technologies, such as Hadoop, should efficiently tackle the incoming large amounts of data and provide organizations with relevant processed information that was formerly neither visible nor manageable. After having briefly recalled the strategic advantages of big data solutions in the introductory remarks, in the first part of this paper, we focus on the advantages of big data solutions in the currently difficult time of the COVID-19 pandemic. We characterize it as an endemic heterogeneous data context; we then outline the advantages of technologies such as Hadoop and its IT suitability in this context. In the second part, we identify two specific advantages of Hadoop solutions, globality combined with flexibility, and we notice that they are at work with a "Hadoop Fusion Approach" that we describe as an optimal response to the context. In the third part, we justify selected qualifications of globality and flexibility by the fact that Hadoop solutions enable comparable returns in opposite contexts of models of partial submodels and of models of final exact systems. In part four, we remark that in both these opposite contexts, Hadoop's solutions allow a large range of needs to be fulfilled, which fits with requirements previously identified as the current heterogeneous data structure of COVID-19 information. In the final part, we propose a framework of strategic data processing conditions. To the best of our knowledge, they appear to be the most suitable to overcome COVID-19 massive information challenges.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Big data for human security: The case of COVID-19
    Cardenas, Pedro
    Ivrissimtzis, Ioannis
    Obara, Boguslaw
    Kureshi, Ibad
    Theodoropoulos, Georgios
    JOURNAL OF COMPUTATIONAL SCIENCE, 2022, 60
  • [42] The Role of Big Data and Machine Learning in COVID-19
    Ababneh, Mustafa
    Aljarrah, Aayat
    Karagozlu, Damla
    BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2020, 11 (02) : 1 - 20
  • [43] Significant Applications of Big Data in COVID-19 Pandemic
    Haleem, Abid
    Javaid, Mohd.
    Khan, Ibrahim Haleem
    Vaishya, Raju
    INDIAN JOURNAL OF ORTHOPAEDICS, 2020, 54 (04) : 526 - 528
  • [44] Significant Applications of Big Data in COVID-19 Pandemic
    Abid Haleem
    Mohd. Javaid
    Ibrahim Haleem Khan
    Raju Vaishya
    Indian Journal of Orthopaedics, 2020, 54 : 526 - 528
  • [45] Using ‘big data’ to disentangle aging and COVID-19
    Ruth R. Montgomery
    Hanno Steen
    Nature Aging, 2021, 1 : 496 - 497
  • [46] Mobile Big Data in the fight against COVID-19
    Benjamins, Richard
    Vos, Jeanine
    Verhulst, Stefaan
    DATA & POLICY, 2022, 4
  • [47] Combat COVID-19 with artificial intelligence and big data
    Lin, Leesa
    Hou, Zhiyuan
    JOURNAL OF TRAVEL MEDICINE, 2020, 27 (05)
  • [48] THINKING OUT OF BOX: A CHALLENGING MANIFESTATION OF PHEOCHROMOCYTOMA CRISIS IN COVID-19 ERA
    Amprai, Monpraween
    Promratpan, Wasinee
    Srimahachota, Suphot
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2022, 79 (09) : 2713 - 2713
  • [49] COVID-19 Pandemic in the New Era of Big Data Analytics: Methodological Innovations and Future Research Directions
    Sheng, Jie
    Amankwah-Amoah, Joseph
    Khan, Zaheer
    Wang, Xiaojun
    BRITISH JOURNAL OF MANAGEMENT, 2021, 32 (04) : 1164 - 1183
  • [50] PERFORMANCE COMPARISON OF APACHE SPARK AND HADOOP FOR MACHINE LEARNING BASED ITERATIVE GBTR ON HIGGS AND COVID-19 DATASETS
    Sewal, Piyush
    Singh, Hari
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1373 - 1386