On the provenance extraction techniques from large scale log files

被引:2
|
作者
Tufek, Alper [1 ]
Aktas, Mehmet S. [1 ]
机构
[1] Yildiz Tech Univ, Comp Engn Dept, Istanbul, Turkey
来源
关键词
machine learning-based provenance extraction; numerical weather prediction models; provenance; provenance analysis;
D O I
10.1002/cpe.6559
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Numerical weather prediction (NWP) models are the most important instruments to predict future weather. Provenance information is of central importance for detecting unexpected events that may develop during the long course of model execution. Besides, the need to share scientific data and results between researchers also highlights the importance of data quality and reliability. The weather research and forecasting (WRF) Model is an open-source NWP model. In this study, we propose a methodology for tracking the WRF model and for generating, storing, and analyzing provenance. We implement the proposed methodology-with a machine learning-based parser, which utilizes classification algorithms to extract provenance information. The proposed approach enables easy management and understanding of numerical weather forecast workflows by providing provenance graphs. By analyzing these graphs, potential faulty situations that may occur during the execution of WRF can be traced to their root causes. Our proposed approach has been evaluated and has been shown to perform well even in a high-frequency provenance information flow.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Large-Scale Entity Extraction from Enterprise Data
    Gupta, Rajeev
    Kondapally, Ranganath
    SECOND INTERNATIONAL CONFERENCE ON AIML SYSTEMS 2022, 2022,
  • [42] AN INTEGRATED SYSTEM ON LARGE SCALE BUILDING EXTRACTION FROM DSM
    Li, Y.
    Zhu, L.
    Shimamura, H.
    Tachibana, K.
    PCV 2010: PHOTOGRAMMETRIC COMPUTER VISION AND IMAGE ANALYSIS, PT II, 2010, 38 : 35 - 39
  • [43] Large-Scale Extraction and Use of Knowledge from Text
    Clark, Peter
    Harrison, Phil
    K-CAP'09: PROCEEDINGS OF THE FIFTH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE, 2009, : 153 - 160
  • [44] An Unsupervised Framework for Detecting Anomalous Messages from Syslog Log Files
    Vaarandi, Risto
    Blumbergs, Bernhards
    Kont, Markus
    NOMS 2018 - 2018 IEEE/IFIP NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM, 2018,
  • [45] Hazardous wastes from large-scale metal extraction
    Anon
    Environmental Science and Technology, 1990, 24 (09):
  • [46] Behavior Mining Language for Mining Expected Behavior from Log Files
    Heikkinen, Esa
    Hamalainen, Timo D.
    PROCEEDINGS OF THE IECON 2016 - 42ND ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2016, : 4607 - 4612
  • [47] Survey on log research of large scale software system
    Liao X.-K.
    Li S.-S.
    Dong W.
    Jia Z.-Y.
    Liu X.-D.
    Zhou S.-L.
    Li, Shan-Shan (shanshanli@nudt.edu.cn), 1934, Chinese Academy of Sciences (27): : 1934 - 1947
  • [48] Extracting and Utilizing Social Networks from Log Files of Shared Workspaces
    Nasirifard, Peyman
    Peristeras, Vassilios
    Hayes, Conor
    Decker, Stefan
    LEVERAGING KNOWLEDGE FOR INNOVATION IN COLLABORATIVE NETWORKS, 2009, 307 : 643 - 650
  • [49] BENCH SCALE LIQUID EXTRACTION TECHNIQUES
    SCHEIBEL, EG
    INDUSTRIAL AND ENGINEERING CHEMISTRY, 1957, 49 (10): : 1679 - 1684
  • [50] Log integration on large scale for global networking monitoring
    缪嘉嘉
    吴泉源
    贾焰
    JournalofCentralSouthUniversityofTechnology, 2009, 16 (06) : 976 - 981