Assessing Big Data SQL Frameworks for Analyzing Event Logs

被引:5
|
作者
Hinkka, Markku [1 ,2 ]
Lehto, Teemu [1 ,2 ]
Heljanko, Keijo [1 ,3 ]
机构
[1] Aalto Univ, Sch Sci, Dept Comp Sci, Aalto, Finland
[2] QPR Software Plc, Helsinki, Finland
[3] Aalto Univ, Helsinki, Finland
关键词
MAPREDUCE;
D O I
10.1109/PDP.2016.26
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Performing Process Mining by analyzing event logs generated by various systems is a very computation and I/O intensive task. Distributed computing and Big Data processing frameworks make it possible to distribute all kinds of computation tasks to multiple computers instead of performing the whole task in a single computer. This paper assesses whether contemporary structured query language (SQL) supporting Big Data processing frameworks are mature enough to be efficiently used to distribute computation of two central Process Mining tasks to two dissimilar clusters of computers providing BPM as a service in the cloud. Tests are performed by using a novel automatic testing framework detailed in this paper and its supporting materials. As a result, an assessment is made on how well selected Big Data processing frameworks manage to process and to parallelize the analysis work required by Process Mining tasks.
引用
收藏
页码:101 / 108
页数:8
相关论文
共 50 条
  • [1] A Reference Data Model to Specify Event Logs for Big Data Pipeline Discovery
    Benvenuti, Dario
    Marrella, Andrea
    Rossi, Jacopo
    Nikolov, Nikolay
    Roman, Dumitru
    Soylu, Ahmet
    Perales, Fernando
    [J]. BUSINESS PROCESS MANAGEMENT FORUM, BPM 2023 FORUM, 2023, 490 : 38 - 54
  • [2] Analyzing SQL payloads using logistic regression in a big data environment
    Shareef, Omar Salah F.
    Hasan, Rehab Flaih
    Farhan, Ammar Hatem
    [J]. JOURNAL OF INTELLIGENT SYSTEMS, 2023, 32 (01)
  • [3] Crossing an OCEAN of Queries: Analyzing SQL Query Logs with OCEANLog
    Wahl, Andreas M.
    Endler, Gregor
    Schwab, Peter K.
    Herbst, Sebastian
    Rith, Julian
    Lenz, Richard
    [J]. 30TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT (SSDBM 2018), 2018,
  • [4] A Graph-Based Framework for Analyzing SQL Query Logs
    Wahl, Andreas M.
    Endler, Gregor
    Schwab, Peter K.
    Rith, Julian
    Herbst, Sebastian
    Lenz, Richard
    [J]. GRADES-NDA '18: PROCEEDINGS OF THE 1ST ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2018 (GRADES-NDA 2018), 2018,
  • [5] Discovering and Analyzing Contextual Behavioral Patterns From Event Logs
    Acheli, Mehdi
    Grigori, Daniela
    Weidlich, Matthias
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2022, 34 (12) : 5708 - 5721
  • [6] Big Data Analytics Frameworks
    Chandarana, Parth
    Vijayalakshmi, M.
    [J]. 2014 INTERNATIONAL CONFERENCE ON CIRCUITS, SYSTEMS, COMMUNICATION AND INFORMATION TECHNOLOGY APPLICATIONS (CSCITA), 2014, : 430 - 434
  • [7] A Big Data analyzer for large trace logs
    Balliu, Alkida
    Olivetti, Dennis
    Babaoglu, Ozalp
    Marzolla, Moreno
    Sirbu, Alina
    [J]. COMPUTING, 2016, 98 (12) : 1225 - 1249
  • [8] Big Data Management the Mass Weather Logs
    Wu, Hao
    [J]. SMART COMPUTING AND COMMUNICATION, SMARTCOM 2016, 2017, 10135 : 122 - 132
  • [9] A Big Data analyzer for large trace logs
    Alkida Balliu
    Dennis Olivetti
    Ozalp Babaoglu
    Moreno Marzolla
    Alina Sîrbu
    [J]. Computing, 2016, 98 : 1225 - 1249
  • [10] Metric for Analyzing Big Data
    Hahanova, Yulia
    Yemelyanov, Igor
    Hahanova, Anna
    Obrizan, Volodymyr
    Krulevska, Daria
    Skorobogatiy, Mikhail
    [J]. PROCEEDINGS OF XIIITH INTERNATIONAL CONFERENCE - EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS IN MICROELECTRONICS CADSM 2015, 2015, : 81 - 83