Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19

被引:15
|
作者
Azeroual, Otmane [1 ]
Fabre, Renaud [2 ]
机构
[1] German Ctr Higher Educ Res & Sci Studies DZHW, D-10117 Berlin, Germany
[2] Univ Paris 08, Dionysian Econ Lab LED, F-93200 St Denis, France
关键词
big data; data processing; unstructured data; large amounts of data; COVID-19; challenges; Hadoop technology; MapReduce; WordCount; ANALYTICS;
D O I
10.3390/bdcc5010012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting. As experienced in former massive information issues, big data technologies, such as Hadoop, should efficiently tackle the incoming large amounts of data and provide organizations with relevant processed information that was formerly neither visible nor manageable. After having briefly recalled the strategic advantages of big data solutions in the introductory remarks, in the first part of this paper, we focus on the advantages of big data solutions in the currently difficult time of the COVID-19 pandemic. We characterize it as an endemic heterogeneous data context; we then outline the advantages of technologies such as Hadoop and its IT suitability in this context. In the second part, we identify two specific advantages of Hadoop solutions, globality combined with flexibility, and we notice that they are at work with a "Hadoop Fusion Approach" that we describe as an optimal response to the context. In the third part, we justify selected qualifications of globality and flexibility by the fact that Hadoop solutions enable comparable returns in opposite contexts of models of partial submodels and of models of final exact systems. In part four, we remark that in both these opposite contexts, Hadoop's solutions allow a large range of needs to be fulfilled, which fits with requirements previously identified as the current heterogeneous data structure of COVID-19 information. In the final part, we propose a framework of strategic data processing conditions. To the best of our knowledge, they appear to be the most suitable to overcome COVID-19 massive information challenges.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Towards Efficient Big Data: Hadoop Data Placing and Processing
    Bahadi, Jihane
    El Asri, Bouchra
    Courtine, Melanie
    Rhanoui, Maryem
    Kergosien, Yannick
    2ND INTERNATIONAL CONFERENCE ON SMART DIGITAL ENVIRONMENT (ICSDE'18), 2018, : 42 - 47
  • [32] Big Data Visualization and Visual Analytics of COVID-19 Data
    Leung, Carson K.
    Chen, Yubo
    Hoi, Calvin S. H.
    Shang, Siyuan
    Wen, Yan
    Cuzzocrea, Alfredo
    2020 24TH INTERNATIONAL CONFERENCE INFORMATION VISUALISATION (IV 2020), 2020, : 415 - 420
  • [33] Hadoop Paradigm for Satellite Environmental Big Data Processing
    Semlali, Badr-Eddine Boudriki
    El Amrani, Chaker
    Ortiz, Guadalupe
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 23 - 47
  • [34] Big medical data processing system based on hadoop
    Liu, W.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 181 - 181
  • [35] Apache Spark: A Unified Engine for Big Data Processing
    Zaharia, Matei
    Xin, Reynold S.
    Wendell, Patrick
    Das, Tathagata
    Armbrust, Michael
    Dave, Ankur
    Meng, Xiangrui
    Rosen, Josh
    Venkataraman, Shivaram
    Franklin, Michael J.
    Ghodsi, Ali
    Gonzalez, Joseph
    Shenker, Scott
    Stoica, Ion
    COMMUNICATIONS OF THE ACM, 2016, 59 (11) : 56 - 65
  • [36] Testing big data in a big crisis: Nowcasting under Covid-19
    Barbaglia, Luca
    Frattarolo, Lorenzo
    Onorante, Luca
    Pericoli, Filippo Maria
    Ratto, Marco
    Pezzoli, Luca Tiozzo
    INTERNATIONAL JOURNAL OF FORECASTING, 2023, 39 (04) : 1548 - 1563
  • [37] Big data in COVID-19 surveillance system: A commentary
    Jahanbakhsh, Maryam
    Ashrafi-Rizi, Hasan
    Jangi, Majid
    Sattari, Mohammad
    JOURNAL OF EDUCATION AND HEALTH PROMOTION, 2020, 9 (01)
  • [38] Machine Learning and OLAP on Big COVID-19 Data
    Leung, Carson K.
    Chen, Yubo
    Hoi, Calvin S. H.
    Shang, Siyuan
    Cuzzocrea, Alfredo
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 5118 - 5127
  • [39] Big data assimilation to improve the predictability of COVID-19
    Li, Xin
    Zhao, Zebin
    Liu, Feng
    GEOGRAPHY AND SUSTAINABILITY, 2020, 1 (04) : 317 - 320
  • [40] Using 'big data' to disentangle aging and COVID-19
    Montgomery, Ruth R. R.
    Steen, Hanno
    NATURE AGING, 2021, 1 (06): : 496 - 497