Comparison of similarity measures and clustering methods for time-series medical data mining

被引:1
|
作者
Hirano, S [1 ]
Tsumoto, S [1 ]
机构
[1] Shimane Med Univ, Sch Med, Dept Med Informat, Izumo, Shimane 6938501, Japan
关键词
multiscale matching; temporal data mining; proximity measure;
D O I
10.1117/12.487508
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper reports characteristics of dissimilarity measures used in the multiscale matching. Multiscale matching is a method for comparing two planar curves by partially changing observation scales. Throughout all scales, it finds the best set of pairs of partial contours that contains no miss-matched or over-matched contours and that minimizes the accumulated differences between the partial contours. In order to make this method applicable to comparison of the temporal sequences, we have proposed a dissimilarity measure that compares subsequences according to the following aspects: rotation angle, length, phase and gradient. However, it empirically became apparent that it was difficult to understand from the results that which aspects were really contributed to the resultant dissimilarity of the sequences. In order to investigate fundamental characteristics of the dissimilarity measure, we performed quantitative analysis of the induced dissimilarities using simple sine wave and its variants. The results showed that differences on the amplitude, phase and trends were respectively captured by the terms on rotation angle, phase and gradient, although they also showed weakness on the linearity.
引用
收藏
页码:219 / 225
页数:7
相关论文
共 50 条
  • [1] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [2] Data mining of time-series medical data by formal concept analysis
    Sato, Kenji
    Okubo, Yoshiaki
    Haraguchi, Iakoto
    Kunifuji, Susumu
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS: KES 2007 - WIRN 2007, PT II, PROCEEDINGS, 2007, 4693 : 1214 - 1221
  • [3] Empirical comparison of clustering methods for long time-series databases
    Hirano, S
    Tsumoto, S
    ACTIVE MINING, 2005, 3430 : 268 - 286
  • [4] A comparative study of clustering methods for long time-series medical databases
    Tsumoto, S
    Hirano, S
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3131 : 260 - 272
  • [5] Clustering of multivariate time-series data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2002 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2002, 1-6 : 3931 - 3936
  • [6] Clustering multivariate time-series data
    Singhal, A
    Seborg, DE
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 427 - 438
  • [7] Fuzzy data mining for time-series data
    Chen, Chun-Hao
    Hong, Tzung-Pei
    Tseng, Vincent S.
    APPLIED SOFT COMPUTING, 2012, 12 (01) : 536 - 542
  • [8] Alignment versus Variation Methods for Clustering Microarray Time-Series Data
    Subhani, Numanul
    Li, Yifeng
    Ngom, Alioune
    Rueda, Luis
    2010 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2010,
  • [9] Learning time-series similarity with a neural network by combining similarity measures
    Sagrebin, Maria
    Goerke, Nils
    ARTIFICIAL NEURAL NETWORKS - ICANN 2006, PT 2, 2006, 4132 : 123 - 132
  • [10] Structural comparison and cluster analysis of time-series medical data
    Hirano, S
    Tsumoto, S
    INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOL 1-4, PROCEEDINGS, 2005, : 1506 - 1511