Uncertain Data Mining: A Review of Optimization Methods for UK-Means

被引:0
|
作者
Aggarwal, Swati [1 ]
Agarwal, Nitika [1 ]
Jain, Monal [1 ]
机构
[1] Netaji Subhas Inst Technol, Dept Comp Engn, Delhi, India
关键词
clustering; uncertainty; pruning; uk-means;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Real world data generally deals with uncertainty. The complexity of data uncertainty poses many challenges. The most widely used k-means clustering algorithm is used for cluster analysis. However, it doesn't deal with uncertain data. The UK-means algorithm, a modification of k-means handles uncertain objects whose locations are represented by probability density functions (pdfs). It computes expected distances(EDs) between objects' pdf and cluster representatives using costly numerical integrations. This paper provides a review about the current state-of-the-art pruning techniques in order to improve the efficiency and countervail the computational complexity of UK-means based upon extensive research performed during the last decade. These are Min-Max bounding box (BB) based technique which uses two ways to calculate bounds: metric and trigonometric. Then, Voronoi based techniques to reduce ED calculations and indexing using R-Tree to reduce pruning overheads are discussed. Finally, hybrid techniques are observed to be feasible for real world employment.
引用
收藏
页码:3672 / 3677
页数:6
相关论文
共 50 条
  • [1] Data mining with various optimization methods
    Nedic, Vladimir
    Cvetanovic, Slobodan
    Despotovic, Danijela
    Despotovic, Milan
    Babic, Sasa
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (08) : 3993 - 3999
  • [2] A Review of Frequent Pattern Mining Algorithms for Uncertain Data
    Bhogadhi, Vani
    Chandak, M. B.
    [J]. PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 974 - 983
  • [3] Mining uncertain data
    Leung, Carson Kai-Sang
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 316 - 329
  • [4] A family of optimization based data mining methods
    Shi, Yong
    Liu, Rong
    Yan, Nian
    Chen, Zhenxing
    [J]. PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 26 - +
  • [5] A Review of Uncertain Decision-Making Methods in Energy Management Using Text Mining and Data Analytics
    Tavana, Madjid
    Shaabani, Akram
    Santos-Arteaga, Francisco Javier
    Vanani, Iman Raeesi
    [J]. ENERGIES, 2020, 13 (15)
  • [6] Review of Algorithm for Mining Frequent Patterns from Uncertain Data
    Yue, Liwen
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 17 - 21
  • [7] Trajectory data mining: A review of methods and applications
    Mazimpaka, Jean Damascene
    Timpf, Sabine
    [J]. JOURNAL OF SPATIAL INFORMATION SCIENCE, 2016, (13): : 61 - 99
  • [8] A review of data mining methods in financial markets
    Liu, Haihua
    Huang, Shan
    Wang, Peng
    Li, Zejun
    [J]. DATA SCIENCE IN FINANCE AND ECONOMICS, 2021, 1 (04): : 362 - 392
  • [9] Mining Uncertain Event Data in Process Mining
    Pegoraro, Marco
    van der Aalst, Wil M. P.
    [J]. 2019 INTERNATIONAL CONFERENCE ON PROCESS MINING (ICPM 2019), 2019, : 89 - 96
  • [10] A review and analysis on data mining methods to predict diabetes
    Ladha, Girdhar Gopal
    Pippal, Ravi Kumar Singh
    [J]. 2017 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2017, : 334 - 337