A Geometric View of Similarity Measures in Data Mining

被引:5
|
作者
Darvishi, A. [1 ]
Hassanpour, H. [1 ]
机构
[1] Univ Shahrood, Fac Comp Engn, Shahrood, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2015年 / 28卷 / 12期
关键词
Data Mining; Feature Extraction; Similarity Measures; Geometric View;
D O I
10.5829/idosi.ije.2015.28.12c.05
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consideration, is generally neglected. Since each measure has its own perspective to the data, different interpretations may achieved on data depending on the used measure. While efforts are often focused on adjusting the feature extraction techniques for mining the data, choosing a suitable measure regarding to the nature or general characteristics of the data or application is more appropriate. Given a couple of sequences, a specific measure may consider them as similar while another one may quantify them as dissimilar. The goal of this research is twofold: evincing the role of feature extraction in data mining and revealing the significance of similarity measures geometric attributes in detecting the relationships between data. Differrent similarity measures are also applied to three synthetic datasets and a real set of ECG time series to examine their performance.
引用
收藏
页码:1728 / 1737
页数:10
相关论文
共 50 条
  • [1] EMP as a similarity measure: a geometric point of view
    Carbo-Dorca, Ramon
    Besalu, Emili
    JOURNAL OF MATHEMATICAL CHEMISTRY, 2013, 51 (01) : 382 - 389
  • [2] EMP as a similarity measure: a geometric point of view
    Ramon Carbó-Dorca
    Emili Besalú
    Journal of Mathematical Chemistry, 2013, 51 : 382 - 389
  • [3] Algorithms for computing geometric measures of melodic similarity
    Aloupis, Greg
    Fevens, Thomas
    Langerman, Stefan
    Matsui, Tomomi
    Mesa, Antonio
    Nunez, Yurai
    Rappaport, David
    Toussaint, Godfried
    COMPUTER MUSIC JOURNAL, 2006, 30 (03) : 67 - 76
  • [4] PICTURES OF RELEVANCE - A GEOMETRIC ANALYSIS OF SIMILARITY MEASURES
    JONES, WP
    FURNAS, GW
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1987, 38 (06): : 420 - 442
  • [5] Geometric similarity measures for the intuitionistic fuzzy sets
    Szmidt, Eulalia
    Kacprzyk, Janusz
    PROCEEDINGS OF THE 8TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT-13), 2013, 32 : 840 - 847
  • [6] Comparison of similarity measures and clustering methods for time-series medical data mining
    Hirano, S
    Tsumoto, S
    DATA MINING AND KNOWLEDGE DISCOVERY: TOOLS AND TECHNOLOGY V, 2003, 5098 : 219 - 225
  • [7] Merging Clusters in Summary Structures for Data Stream Mining based on Fuzzy Similarity Measures
    Schick, Leonardo
    Lopes, Priscilla de Abreu
    Camargo, Heloisa A.
    PROCEEDINGS OF THE 11TH CONFERENCE OF THE EUROPEAN SOCIETY FOR FUZZY LOGIC AND TECHNOLOGY (EUSFLAT 2019), 2019, 1 : 812 - 819
  • [8] Similarity Measures for Intersection of Camera View Frustums
    Zamani, Yasin
    Shirzad, Hamed
    Kasaei, Shohreh
    2017 10TH IRANIAN CONFERENCE ON MACHINE VISION AND IMAGE PROCESSING (MVIP), 2017, : 171 - 175
  • [9] SIMILARITY MEASURES ON BINARY DATA
    JANOWITZ, MF
    SYSTEMATIC ZOOLOGY, 1980, 29 (04): : 342 - 359
  • [10] Similarity Measures for Multidimensional Data
    Baikousi, Eftychia
    Rogkakos, Georgios
    Vassiliadis, Panos
    IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 171 - 182