β Algorithm: A New Probabilistic Process Learning Approach for Big Data in Healthcare

被引:8
|
作者
Zayoud, Maha [1 ]
Kotb, Yehia [1 ]
Ionescu, Sorin [2 ]
机构
[1] Amer Univ Middle East, Coll Engn & Technol, Al Uqiylah, Kuwait
[2] Univ Politehn Bucuresti, Ind Engn Dept, Bucharest, Romania
关键词
Big data; alpha algorithm; event logs; process mining; healthcare; PETRI NETS;
D O I
10.1109/ACCESS.2019.2922635
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new process learning framework that is based on probabilistic learning and predicate logic is proposed. The input of this framework is a set of log files, and the output is a probabilistic predicate-based workflow that describes the process. This paper targets a methodology of learning processes given data and the learning algorithm finds out the logical operators that bind the events described in data and model it using predicate logic. While building the process, the probability of every event and the probabilities of the relationship between events are calculated. The learning process is an ongoing process, which means after learning when feeding the system with a new set of log files, the algorithm takes the previously learned process and it is set of probabilities as a starting state and starts modifying them based on the newly learned log files. This feature is very essential for those applications that integrate and interact with bigdata since for bigdata, starting the learning process from the beginning for every new set of data is not feasible. In this paper, the assumption is that log files are event-based, and every event is associated with its time of occurrence. Any event could have multiple occurrence times throughout the log files. The framework provides an optimal general definition of a process that is described by those log files. The process could change schematically or with respect to behavior when learning a new set of logs. In order to achieve what is described, a dependency matrix needs to be learned, and then the probability matrix is calculated. The outcome of the two matrices is a predicate-based workflow. Workflows can easily be described by Petri nets and Petri nets can map to predicate logic. The reason to convert the workflow into the knowledge base is the ability to infer new facts from given facts we conclude from log files. In this paper, we integrate a modification to alpha algorithm with the framework in order to describe dependencies and probability of occurrences of events.
引用
收藏
页码:78842 / 78869
页数:28
相关论文
共 50 条
  • [1] A new process for healthcare big data warehouse integration
    Arfaoui, Nouha
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2023, 15 (03) : 240 - 254
  • [2] Federated Learning Approach to Protect Healthcare Data over Big Data Scenario
    Dhiman, Gaurav
    Juneja, Sapna
    Mohafez, Hamidreza
    El-Bayoumy, Ibrahim
    Sharma, Lokesh Kumar
    Hadizadeh, Maryam
    Islam, Mohammad Aminul
    Viriyasitavat, Wattana
    Khandaker, Mayeen Uddin
    [J]. SUSTAINABILITY, 2022, 14 (05)
  • [3] Distributed algorithm for big data analytics in healthcare
    Forestiero, Agostino
    Papuzzo, Giuseppe
    [J]. 2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 776 - 779
  • [4] A New Lexicon Learning Algorithm for Sentiment Analysis of Big Data
    Keshavarz, Hamidreza
    Abadeh, Mohammad Saniee
    Almasi, Mehrdad
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY), 2017, : 249 - 253
  • [5] A probabilistic algorithm to process geolocation data
    Benjamin Merkel
    Richard A. Phillips
    Sébastien Descamps
    Nigel G. Yoccoz
    Børge Moe
    Hallvard Strøm
    [J]. Movement Ecology, 4
  • [6] A probabilistic algorithm to process geolocation data
    Merkel, Benjamin
    Phillips, Richard A.
    Descamps, Sebastien
    Yoccoz, Nigel G.
    Moe, Borge
    Strom, Hallvard
    [J]. MOVEMENT ECOLOGY, 2016, 4
  • [7] Big Data and Machine Learning Framework in Healthcare
    Dogaru, Delia Ioana
    Dumitrache, Ioan
    [J]. 2019 E-HEALTH AND BIOENGINEERING CONFERENCE (EHB), 2019,
  • [8] A Probabilistic Approach for Threaded Process Learning
    Zayoud, Maha
    Oueida, Soraia
    Kotb, Yehia
    AbiChar, Pierre
    [J]. 2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [9] Big data Analytics in Healthcare: A Survey Approach
    Ramesh, Dharavath
    Suraj, Pranshu
    Saini, Lokendra
    [J]. 2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM), 2016,
  • [10] Putting the data before the algorithm in big data addressing personalized healthcare
    Cahan, Eli M.
    Hernandez-Boussard, Tina
    Thadaney-Israni, Sonoo
    Rubin, Daniel L.
    [J]. NPJ DIGITAL MEDICINE, 2019, 2 (1)