Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams

被引:0
|
作者
Lukic, Ivica [1 ]
Hocenski, Zeljko [1 ]
Kohler, Mirko [1 ]
Galba, Tomislav [1 ]
机构
[1] Josip Juraj Strossmayer Univ Osijek, Fac Elect Engn Comp Sci & Informat Technol Osijek, Dept Comp Engn & Automat, Osijek, Croatia
关键词
Clustering algorithms; data mining; data uncertainty; Euclidean distance; parallel algorithms;
D O I
10.1080/00051144.2018.1541645
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering of uncertain objects in large uncertain databases and problem of mining uncertain data has been well studied. In this paper, clustering of uncertain objects with location uncertainty is studied. Moving objects, like mobile devices, report their locations periodically, thus their locations are uncertain and best described by a probability density function. The number of objects in a database can be large which makes the process of mining accurate data, a challenging and time consuming task. Authors will give an overview of existing clustering methods and present a new approach for data mining and parallel computing of clustering problems. All existing methods use pruning to avoid expected distance calculations. It is required to calculate the expected distance numerical integration, which is time-consuming. Therefore, a new method, called Segmentation of Data Set Area-Parallel, is proposed. In this method, a data set area is divided into many small segments. Only clusters and objects in that segment are observed. The number of segments is calculated using the number and location of clusters. The use of segments gives the possibility of parallel computing, because segments are mutually independent. Thus, each segment can be computed on multiple cores.
引用
收藏
页码:349 / 356
页数:8
相关论文
共 50 条
  • [1] Clustering Uncertain Data using Voronoi Diagrams
    Kao, Ben
    Lee, Sau Dan
    Cheung, David W.
    Ho, Wai-Shing
    Chan, K. F.
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 333 - 342
  • [2] Clustering Uncertain Data Using Voronoi Diagrams and R-Tree Index
    Kao, Ben
    Lee, Sau Dan
    Lee, Foris K. F.
    Cheung, David Wai-lok
    Ho, Wai-Shing
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (09) : 1219 - 1233
  • [3] Parallel Clustering of large data set on Hadoop using Data mining techniques
    Chaturbhuj, Kaustubh S.
    Chaudhary, Gauri
    2016 WORLD CONFERENCE ON FUTURISTIC TRENDS IN RESEARCH AND INNOVATION FOR SOCIAL WELFARE (STARTUP CONCLAVE), 2016,
  • [4] Map segmentation for geospatial data mining through generalized higher-order Voronoi diagrams with sequential scan algorithms
    Lee, Ickjai
    Torpelund-Bruin, Christopher
    Lee, Kyungmi
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (12) : 11135 - 11148
  • [5] Urban data visualization with Voronoi Diagrams
    Abellanas, Manuel
    Palop, Belen
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2008, PT 1, PROCEEDINGS, 2008, 5072 : 126 - +
  • [6] Parallel SVM for large data-set mining
    Qian, L
    Hung, T
    DATA MINING IV, 2004, 7 : 661 - 670
  • [7] Accuracy of Estimating the Area of Cortical Muscle Representations from TMS Mapping Data Using Voronoi Diagrams
    Andrey Yu. Chernyavskiy
    Dmitry O. Sinitsyn
    Alexandra G. Poydasheva
    Ilya S. Bakulin
    Natalia A. Suponeva
    Michael A. Piradov
    Brain Topography, 2019, 32 : 859 - 872
  • [8] Accuracy of Estimating the Area of Cortical Muscle Representations from TMS Mapping Data Using Voronoi Diagrams
    Chernyayskiy, Andrey Yu
    Sinitsyn, Dmitry O.
    Poydasheva, Alexandra G.
    Bakulin, Ilya S.
    Suponeva, Natalia A.
    Piradov, Michael A.
    BRAIN TOPOGRAPHY, 2019, 32 (05) : 859 - 872
  • [9] Mining uncertain data
    Leung, Carson Kai-Sang
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (04) : 316 - 329
  • [10] Trajectory Mining Using Uncertain Sensor Data
    Muzammal, Muhammad
    Gohar, Moneeb
    Rahman, Arif Ur
    Qu, Qiang
    Ahmad, Awais
    Jeon, Gwanggil
    IEEE ACCESS, 2018, 6 : 4895 - 4903