Consistent discovery of frequent interval-based temporal patterns in chronic patients' data

被引:24
|
作者
Shknevsky, Alexander [1 ]
Shahar, Yuval [1 ]
Moskovitch, Robert [1 ]
机构
[1] Ben Gurion Univ Negev, Software & Informat Syst Engn, Beer Sheva, Israel
关键词
Temporal data mining; Temporal knowledge discovery; Temporal abstraction; Time intervals mining; Frequent pattern mining; Pattern repetition; Pattern consistency; Clustering; Classification; Prediction; ORIENTED CLINICAL-DATA; ABSTRACTION; INTELLIGENT; EXPLORATION; DEPENDENCIES; SYSTEM; RULES; CARE;
D O I
10.1016/j.jbi.2017.10.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Increasingly, frequent temporal patterns discovered in longitudinal patient records are proposed as features for classification and prediction, and as means to cluster patient clinical trajectories. However, to justify that, we must demonstrate that most frequent temporal patterns are indeed consistently discoverable within the records of different patient subsets within similar patient populations. We have developed several measures for the consistency of the discovery of temporal patterns. We focus on time-interval relations patterns (TIRPs) that can be discovered within different subsets of the same patient population. We expect the discovered TIRPs (1) to be frequent in each subset, (2) preserve their "local" metrics - the absolute frequency of each pattern, measured by a Proportion Test, and (3) preserve their "global" characteristics - their overall distribution, measured by a Kolmogorov-Smirnov test. We also wanted to examine the effect on consistency, over a variety of settings, of varying the minimal frequency threshold for TIRP discovery, and of using a TIRP-filtering criterion that we previously introduced, the Semantic Adjacency Criterion (SAC). We applied our methodology to three medical domains (oncology, infectious hepatitis, and diabetes). We found that, within the minimal frequency ranges we had examined, 70-95% of the discovered TIRPs were consistently discoverable; 40-48% of them maintained their local frequency. TIRP global distribution similarity varied widely, from 0% to 65%. Increasing the threshold usually increased the percentage of TIRPs that were repeatedly discovered across different patient subsets within the same domain, and the probability of a similar TIRP distribution. Using the SAC principle, enhanced, for most minimal support levels, the percentage of repeating TIRPs, their local consistency and their global consistency. The effect of using the SAC was further strengthened as the minimal frequency threshold was raised.
引用
收藏
页码:83 / 95
页数:13
相关论文
共 50 条
  • [1] Mining Temporal Patterns in Interval-Based Data
    Chen, Yi-Cheng
    Peng, Wen-Chih
    Lee, Suh-Yin
    [J]. 2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1506 - 1507
  • [2] Mining Temporal Patterns in Time Interval-Based Data
    Chen, Yi-Cheng
    Peng, Wen-Chih
    Lee, Suh-Yin
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (12) : 3318 - 3331
  • [3] Mining Frequent Time Interval-based Event with Duration Patterns from Temporal Database
    Chen, Kuan-Ying
    Jaysawal, Bijay Prasad
    Huang, Jen-Wei
    Wu, Yong-Bin
    [J]. 2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 548 - 554
  • [4] Discovering temporal patterns for interval-based events
    Kam, PS
    Fu, AWC
    [J]. DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2000, 1874 : 317 - 326
  • [5] A novel algorithm for mining closed temporal patterns from interval-based data
    Yi-Cheng Chen
    Julia Tzu-Ya Weng
    Lin Hui
    [J]. Knowledge and Information Systems, 2016, 46 : 151 - 183
  • [6] Mining High-utility Temporal Patterns on Time Interval-based Data
    Wang, Jun-Zhe
    Chen, Yi-Cheng
    Shih, Wen-Yueh
    Yang, Lin
    Liu, Yu-Shao
    Huang, Jiun-Long
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2020, 11 (04)
  • [7] A novel algorithm for mining closed temporal patterns from interval-based data
    Chen, Yi-Cheng
    Weng, Julia Tzu-Ya
    Hui, Lin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (01) : 151 - 183
  • [8] Incrementally Mining Temporal Patterns in Interval-based Databases
    Chen, Yi-Cheng
    Weng, Julia Tzu-Ya
    Wang, Jun-Zhe
    Chou, Chien-Li
    Huang, Jiun-Long
    Lee, Suh-Yin
    [J]. 2014 INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2014, : 304 - 311
  • [9] Mining nonambiguous temporal patterns for interval-based events
    Wu, Shin-Yi
    Chen, Yen-Liang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (06) : 742 - 758
  • [10] Incremental mining of temporal patterns in interval-based database
    Hui, Lin
    Chen, Yi-Cheng
    Weng, Julia Tzu-Ya
    Lee, Suh-Yin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2016, 46 (02) : 423 - 448