On the provenance extraction techniques from large scale log files

被引:2
|
作者
Tufek, Alper [1 ]
Aktas, Mehmet S. [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
关键词
machine learning-based provenance extraction; numerical weather prediction models; provenance; provenance analysis;
D O I
10.1002/cpe.6559
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Numerical weather prediction (NWP) models are the most important instruments to predict future weather. Provenance information is of central importance for detecting unexpected events that may develop during the long course of model execution. Besides, the need to share scientific data and results between researchers also highlights the importance of data quality and reliability. The weather research and forecasting (WRF) Model is an open-source NWP model. In this study, we propose a methodology for tracking the WRF model and for generating, storing, and analyzing provenance. We implement the proposed methodology-with a machine learning-based parser, which utilizes classification algorithms to extract provenance information. The proposed approach enables easy management and understanding of numerical weather forecast workflows by providing provenance graphs. By analyzing these graphs, potential faulty situations that may occur during the execution of WRF can be traced to their root causes. Our proposed approach has been evaluated and has been shown to perform well even in a high-frequency provenance information flow.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] On the Provenance Extraction Techniques from Large Scale Log Files: A Case Study for the Numerical Weather Prediction Models
    Tufek, Alper
    Aktas, Mehmet S.
    EURO-PAR 2020: PARALLEL PROCESSING WORKSHOPS, 2021, 12480 : 249 - 260
  • [2] Terminology Extraction from Log Files
    Saneifar, Hassan
    Bonniol, Stephane
    Laurent, Anne
    Poncelet, Pascal
    Roche, Mathieu
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 769 - +
  • [3] An Extensive Comparison of Systems for Entity Extraction from Log Files
    Chhabra, Anubhav
    Branco, Paula
    Jourdan, Guy-Vincent
    Viktor, Herna L.
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2021, 2022, 13291 : 376 - 392
  • [4] Data Mining Algorithms for Knowledge Extraction from Web Log Files
    El Alami, Anass Abdelhamid
    Ezzikouri, Hanane
    Erritali, Mohammed
    ADVANCED INTELLIGENT SYSTEMS FOR SUSTAINABLE DEVELOPMENT (AI2SD'2019): VOL 1 - ADVANCED INTELLIGENT SYSTEMS FOR EDUCATION AND INTELLIGENT LEARNING SYSTEM, 2020, 1102 : 118 - 128
  • [5] From Terminology Extraction to Terminology Validation: An Approach Adapted to Log Files
    Saneifar, Hassan
    Bonniol, Stephane
    Poncelet, Pascal
    Roche, Mathieu
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2015, 21 (04) : 604 - 635
  • [6] An Effective Approach for Parsing Large Log Files
    Sedki, Issam
    Hamou-Lhadj, Abdelwahab
    Ait-Mohamed, Otmane
    Shehab, Mohammed A.
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 1 - 12
  • [7] Extraction of information from log files Using Python']Python Programming and Tableau
    Rigueira, Filipe
    Bernardino, Jorge
    Pedrosa, Isabel
    2020 15TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI'2020), 2020,
  • [8] The Research of Preprocessing and Pattern Discovery Techniques on Web Log files
    Dhanalakshmi, P.
    Ramani, K.
    Reddy, B. Eswara
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (IACC), 2016, : 139 - 145
  • [9] Analysis of log files applying mining techniques and fuzzy logic
    Escobar-Jeria, Victor H.
    Martin-Bautista, Maria J.
    Sanchez, Daniel
    Vila, Maria-Amparo
    NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 483 - +
  • [10] Anomaly Detection in Log Files Based on Machine Learning Techniques
    Hussein, Salam Allawi
    Repas, Sandor R.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1299 - 1311