Cluster Sequence Mining: Causal Inference with Time and Space Proximity Under Uncertainty

被引:1
|
作者
Okada, Yoshiyuki [1 ]
Fukui, Ken-ichi [1 ]
Moriyama, Koichi [1 ]
Numao, Masayuki [1 ]
机构
[1] Osaka Univ, Inst Sci & Ind Res, Ibaraki, Japan
关键词
Hierarchical clustering; Pattern mining; Bayesian learning; Earthquake;
D O I
10.1007/978-3-319-18032-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a pattern mining algorithm for numerical multidimensional event sequences, called cluster sequence mining (CSM). CSM extracts patterns with a pair of clusters that satisfies space proximity of the individual clusters and time proximity in time intervals between events from different clusters. CSM is an extension of a unique algorithm (co-occurrence cluster mining (CCM)), considering the order of events and the distribution of time intervals. The probability density of the time intervals is inferred by utilizing Bayesian inference for robustness against uncertainty. In an experiment using synthetic data, we confirmed that CSM is capable of extracting clusters with high F-measure and low estimation error of the time interval distribution even under uncertainty. CSM was applied to an earthquake event sequence in Japan after the 2011 Tohoku Earthquake to infer causality of earthquake occurrences. The results demonstrate that CSM suggests some high affecting/affected areas in the subduction zone farther away from the main shock of the Tohoku Earthquake.
引用
收藏
页码:293 / 304
页数:12
相关论文
共 32 条