Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams

被引:0
|
作者
Lukic, Ivica [1 ]
Hocenski, Zeljko [1 ]
Kohler, Mirko [1 ]
Galba, Tomislav [1 ]
机构
[1] Josip Juraj Strossmayer Univ Osijek, Fac Elect Engn Comp Sci & Informat Technol Osijek, Dept Comp Engn & Automat, Osijek, Croatia
关键词
Clustering algorithms; data mining; data uncertainty; Euclidean distance; parallel algorithms;
D O I
10.1080/00051144.2018.1541645
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering of uncertain objects in large uncertain databases and problem of mining uncertain data has been well studied. In this paper, clustering of uncertain objects with location uncertainty is studied. Moving objects, like mobile devices, report their locations periodically, thus their locations are uncertain and best described by a probability density function. The number of objects in a database can be large which makes the process of mining accurate data, a challenging and time consuming task. Authors will give an overview of existing clustering methods and present a new approach for data mining and parallel computing of clustering problems. All existing methods use pruning to avoid expected distance calculations. It is required to calculate the expected distance numerical integration, which is time-consuming. Therefore, a new method, called Segmentation of Data Set Area-Parallel, is proposed. In this method, a data set area is divided into many small segments. Only clusters and objects in that segment are observed. The number of segments is calculated using the number and location of clusters. The use of segments gives the possibility of parallel computing, because segments are mutually independent. Thus, each segment can be computed on multiple cores.
引用
收藏
页码:349 / 356
页数:8
相关论文
共 50 条
  • [41] Parallel data mining experimentation using flexible configurations
    Peña, JM
    Crespo, FJ
    Mensalvas, E
    Robles, V
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 : 441 - 448
  • [42] Segmentation of page images using the area Voronoi diagram
    Kise, K
    Sato, A
    Iwata, M
    COMPUTER VISION AND IMAGE UNDERSTANDING, 1998, 70 (03) : 370 - 382
  • [43] A generic triangle-based data structure of the complete set of higher order Voronoi diagrams for emergency management
    Lee, Ickjai
    Lee, Kyungmi
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2009, 33 (02) : 90 - 99
  • [44] Stream mining on univariate uncertain data
    Liu, Ying-Ho
    APPLIED INTELLIGENCE, 2013, 39 (02) : 315 - 344
  • [45] Updating Mining Resources with Uncertain Data
    Neves, Joao
    Pereira, Maria Joao
    Pacheco, Nelson
    Soares, Amilcar
    MATHEMATICAL GEOSCIENCES, 2019, 51 (07) : 905 - 924
  • [46] Stream mining on univariate uncertain data
    Ying-Ho Liu
    Applied Intelligence, 2013, 39 : 315 - 344
  • [47] Updating Mining Resources with Uncertain Data
    João Neves
    Maria João Pereira
    Nelson Pacheco
    Amilcar Soares
    Mathematical Geosciences, 2019, 51 : 905 - 924
  • [48] Frequent Pattern Mining with Uncertain Data
    Aggarwal, Charu C.
    Li, Yan
    Wang, Jianyong
    Wang, Jing
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 29 - 37
  • [49] Strategies for parallel data mining
    Skillicorn, D
    IEEE CONCURRENCY, 1999, 7 (04): : 26 - 35
  • [50] Visualization and data exploration of chromosome conformation capture data using Voronoi diagrams with v3c-viz
    Race, Alan M.
    Fuchs, Alisa
    Chung, Ho-Ryun
    SCIENTIFIC REPORTS, 2023, 13 (01)