Parallel mining of uncertain data using segmentation of data set area and Voronoi diagrams

被引:0
|
作者
Lukic, Ivica [1 ]
Hocenski, Zeljko [1 ]
Kohler, Mirko [1 ]
Galba, Tomislav [1 ]
机构
[1] Josip Juraj Strossmayer Univ Osijek, Fac Elect Engn Comp Sci & Informat Technol Osijek, Dept Comp Engn & Automat, Osijek, Croatia
关键词
Clustering algorithms; data mining; data uncertainty; Euclidean distance; parallel algorithms;
D O I
10.1080/00051144.2018.1541645
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering of uncertain objects in large uncertain databases and problem of mining uncertain data has been well studied. In this paper, clustering of uncertain objects with location uncertainty is studied. Moving objects, like mobile devices, report their locations periodically, thus their locations are uncertain and best described by a probability density function. The number of objects in a database can be large which makes the process of mining accurate data, a challenging and time consuming task. Authors will give an overview of existing clustering methods and present a new approach for data mining and parallel computing of clustering problems. All existing methods use pruning to avoid expected distance calculations. It is required to calculate the expected distance numerical integration, which is time-consuming. Therefore, a new method, called Segmentation of Data Set Area-Parallel, is proposed. In this method, a data set area is divided into many small segments. Only clusters and objects in that segment are observed. The number of segments is calculated using the number and location of clusters. The use of segments gives the possibility of parallel computing, because segments are mutually independent. Thus, each segment can be computed on multiple cores.
引用
收藏
页码:349 / 356
页数:8
相关论文
共 50 条
  • [31] The impact of big data market segmentation using data mining and clustering techniques
    Yoseph, Fahed
    Malim, Nurul Hashimah Ahamed Hassain
    Heikkila, Markku
    Brezulianu, Adrian
    Geman, Oana
    Rostam, Nur Aqilah Paskhal
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (05) : 6159 - 6173
  • [32] Data structures and algorithms to support interactive spatial analysis using dynamic Voronoi diagrams
    Gahegan, M.
    Lee, I.
    Computers, Environment and Urban Systems, 2000, 24 (06) : 509 - 537
  • [33] Data mining using extensions of the rough set model
    Lingras, PJ
    Yao, YY
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1998, 49 (05): : 415 - 422
  • [34] Data mining on views in a parallel data server
    Sarkar, S
    Sarkar, S
    PROCEEDINGS OF THE HIGH-PERFORMANCE COMPUTING (HPC'98), 1998, : 133 - 138
  • [35] Voronoi Diagrams Based Digital Tattoo for Multimedia Data Protection
    Jana, Sharmistha
    Jana, Biswapati
    Singh, Prabhash Kumar
    Bera, Prasenjit
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2021, 2022, 1534 : 777 - 793
  • [36] Parallel Processing of Image Segmentation Data Using Hadoop
    Akhtar, M. Nishat
    Saleh, Junita Mohamad
    Grelck, C.
    INTERNATIONAL JOURNAL OF INTEGRATED ENGINEERING, 2018, 10 (01): : 74 - 84
  • [37] Data Structures for Halfplane Proximity Queries and Incremental Voronoi Diagrams
    Boris Aronov
    Prosenjit Bose
    Erik D. Demaine
    Joachim Gudmundsson
    John Iacono
    Stefan Langerman
    Michiel Smid
    Algorithmica, 2018, 80 : 3316 - 3334
  • [38] Reaction Diffusion Voronoi Diagrams: From Sensors Data to Computing
    Vazquez-Otero, Alejandro
    Faigl, Jan
    Dormido, Raquel
    Duro, Natividad
    SENSORS, 2015, 15 (06) : 12736 - 12764
  • [39] Data Structures for Halfplane Proximity Queries and Incremental Voronoi Diagrams
    Aronov, Boris
    Bose, Prosenjit
    Demaine, Erik D.
    Gudmundsson, Joachim
    Iacono, John
    Langerman, Stefan
    Smid, Michiel
    ALGORITHMICA, 2018, 80 (11) : 3316 - 3334
  • [40] Data structures for halfplane proximity queries and incremental Voronoi diagrams
    Aronov, B
    Bose, P
    Demaine, ED
    Gudmundsson, J
    Iacono, J
    Langerman, S
    Smid, M
    LATIN 2006: THEORETICAL INFORMATICS, 2006, 3887 : 80 - 92