A Geometric View of Similarity Measures in Data Mining

被引:5
|
作者
Darvishi, A. [1 ]
Hassanpour, H. [1 ]
机构
[1] Univ Shahrood, Fac Comp Engn, Shahrood, Iran
来源
INTERNATIONAL JOURNAL OF ENGINEERING | 2015年 / 28卷 / 12期
关键词
Data Mining; Feature Extraction; Similarity Measures; Geometric View;
D O I
10.5829/idosi.ije.2015.28.12c.05
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consideration, is generally neglected. Since each measure has its own perspective to the data, different interpretations may achieved on data depending on the used measure. While efforts are often focused on adjusting the feature extraction techniques for mining the data, choosing a suitable measure regarding to the nature or general characteristics of the data or application is more appropriate. Given a couple of sequences, a specific measure may consider them as similar while another one may quantify them as dissimilar. The goal of this research is twofold: evincing the role of feature extraction in data mining and revealing the significance of similarity measures geometric attributes in detecting the relationships between data. Differrent similarity measures are also applied to three synthetic datasets and a real set of ECG time series to examine their performance.
引用
收藏
页码:1728 / 1737
页数:10
相关论文
共 50 条
  • [21] A microeconomic view of data mining
    Kleinberg, J
    Papadimitriou, C
    Raghavan, P
    DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (04) : 311 - 324
  • [22] Learning similarity measures from data
    Mathisen, Bjorn Magnus
    Aamodt, Agnar
    Bach, Kerstin
    Langseth, Helge
    PROGRESS IN ARTIFICIAL INTELLIGENCE, 2020, 9 (02) : 129 - 143
  • [23] Biclustering Similarity Measures for Heterogeneous Data
    Chaiwuttisak, Pornpimol
    Dhaenens, Clarisse
    Jourdan, Laetitia
    Vandromme, Maxence
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND TECHNOLOGY (ICAST'18), 2018, 2016
  • [24] Learning similarity measures from data
    Bjørn Magnus Mathisen
    Agnar Aamodt
    Kerstin Bach
    Helge Langseth
    Progress in Artificial Intelligence, 2020, 9 : 129 - 143
  • [25] Probabilistic vs. geometric similarity measures for image retrieval
    Aksoy, S
    Haralick, RM
    IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, VOL II, 2000, : 357 - 362
  • [26] JUDGED SIMILARITY AND BRAND-SWITCHING DATA AS SIMILARITY MEASURES
    LEHMANN, DR
    JOURNAL OF MARKETING RESEARCH, 1972, 9 (03) : 331 - 334
  • [27] A review of alignment based similarity measures for web usage mining
    Vinh-Trung Luu
    Germain Forestier
    Jonathan Weber
    Paul Bourgeois
    Fahima Djelil
    Pierre-Alain Muller
    Artificial Intelligence Review, 2020, 53 : 1529 - 1551
  • [28] A review of alignment based similarity measures for web usage mining
    Luu, Vinh-Trung
    Forestier, Germain
    Weber, Jonathan
    Bourgeois, Paul
    Djelil, Fahima
    Muller, Pierre-Alain
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (03) : 1529 - 1551
  • [29] Similarity discovery techniques in temporal data mining
    Pan, Ding
    Shen, Jun-Yi
    Ruan Jian Xue Bao/Journal of Software, 2007, 18 (02): : 246 - 258
  • [30] Mining on the Basis of Similarity in Graph and Image Data
    Srivastava, Vishal
    Biswas, Bhaskar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, PT II, 2019, 956 : 193 - 203